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PREFACE FOR 
THE INSTRUCTOR 


This book evolved from a course that has been taught at Boston College 
for many years in many different forms. The course, taken primarily by 
sophomore mathematics majors, has always been intended to prepare those 
students for the ‘“‘abstract mathematics” of the title, for the rigor, careful 
argument, and logical precision that, although merely glimpsed in calculus, 
would be linchpins of all further study. Most students will have had two or 
three semesters of calculus and a semester of linear algebra so that they will 
have attained a reasonable degree of mathematical sophistication before 
starting this book. 

Our choice of topics is motivated by the fact that students take introduc- 
tory courses in abstract algebra and real analysis in their junior or senior 
years as mathematics majors. In order to be able to handle these courses 
effectively, they need to know the rules of logic and the rudiments of set 
theory as well as some basic properties of functions between sets: injective 
and surjective functions, image and inverse image, and invertible functions. 

The book divides naturally into two (overlapping, to be sure) parts. 


* Chapters 1-5 introduce the fundamentals of abstract mathematics. 
Logic, set theory, relations, functions, and operations lead to a careful study 
of one axiom system, the integers. These chapters form the core of the book. 
We always cover this material in our course, and we use it to introduce our 
students to the language of mathematics and to prepare them for their 
undergraduate careers in mathematics. 

© Chapters 6-8 apply the ideas and techniques of the earlier material. 
These chapters for the most part are independent, and we choose which 
ones to cover as time permits and interests lead. The topics, though, are 
ones that we feel our students should know, will not find completely alien, 
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and will stand them in good stead later: infinite sets and cardinality; the 
properties of the real and complex numbers; and the important properties 
of polynomials over those fields, such as unique factorization. 


Each section contains a wealth of examples and exercises. For the most 
part, especially in the early chapters, these concern familiar properties of 
the integers as well as some concepts from calculus, such as continuity and 
differentiability. (We do not define these latter topics rigorously, for they 
belong more properly in a course in real analysis.) A major goal of the text 
is for students to write mathematical proofs coherently and correctly, so we 
reinforce the logic developed in Chapter 1 throughout the book. In many 
exercises students must negate a statement or write a converse or a contrapos- 
itive. Some of the exercises build on the examples in the section, while others 
are a bit more challenging. We often leave some details of a proof as exercises, 
but only if, by then, students should find them routine. We think it is essential 
that instructors encourage (require?) students to fill in these gaps. 

We have also included some ‘“‘Discussion and Discovery Exercises” that 
are designed to have the students think a bit more deeply about what they 
have learned. In some of these exercises, they may propose a conjecture and 
then try to prove it or they may criticize a given proof for logical fallacies 
or lack of clarity. At the end of each section we have included ‘“‘Historical 
Comments” or a ‘‘Mathematical Perspective,’ which attempt to put some 
of the subject matter in a wider historical and mathematical context. We do 
not mean them merely as supplemental biographies but intend them to give 
students some sense of the kinds of problems mathematicians have struggled 
with over the centuries, from the time of Euclid to the present day. The 
topics in these commentaries are the authors’ personal choices, not meant 
to represent what every math major should know, but rather to be samples 
of some interesting material that can stimulate further study. Students can 
consult the list of references at the end of the book if they wish to learn more. 

Chapter 1 introduces the rules of logic that students need to know in 
order to understand written proofs and to write their own. This means 
that we discuss nuts and bolts: statements, propositions, quantifiers, rules of 
inference, and the like. Since this book is not designed for a full-fledged 
course in mathematical logic, we strive not to be too technical, while main- 
taining an appropriate level of rigor. The goal from the beginning is for 
students to write mathematics with the proper style and notation and with 
the correct ‘‘grammar”’ and connecting phrases, so that the end product is 
lucid, logical, and mathematically correct. 

Chapter 2 presents the rudiments of set theory: subsets, complements, 
union, and intersection, as well as infinite families of sets and partitions. An 
important goal of this chapter, in addition to a familiarity with the language 
of sets, is for students to learn how to prove (or disprove) that two sets are 
equal. Our experience tells us that students often have difficulty just begin- 
ning such a proof, so we provide many examples and exercises of this type. 
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(You will find such examples and exercises not only in this chapter, but also 
as they appear in context in other chapters.) If Chapter 6 is not covered, 
the instructor can omit Section 2.3 on infinite families of sets. 

Chapter 3 begins the study of functions as mappings between sets. We 
use the more informal definition of function as a rule rather than as a set 
of ordered pairs because it is an approach students are familiar with, involves 
no significant loss of rigor, and is the way functions actually are used. We 
discuss the notions of injective, surjective, and invertible functions thoroughly 
since they are basic to any understanding of abstract algebra. We give exam- 
ples involving image and inverse image since they are part of the language 
of analysis. We show how the tools of calculus (the Intermediate Value 
Theorem, for example) can be used to prove whether or not a function is 
invertible; we invite students to review some of the concepts from their 
calculus courses. We also want students to think in terms of sets of functions, 
so that binary operations on these sets will not seem so strange when we 
introduce them. 

Chapter 4 presents binary operations and relations, two relatively inde- 
pendent topics that are prerequisites for any abstract algebra course and for 
Chapter 5. Some of the exercises in Chapter 4 are similar to problems students 
do in a beginning abstract or linear algebra course, problems that are fairly 
routine but can be difficult for some students when seen for the first time. 
For example, a standard problem in group theory is to show that the intersec- 
tion of two subgroups of a group is itself a subgroup. This problem involves 
a special case of a problem in Section 4.1; namely, to prove that if two subsets 
of a set with a binary operation are closed with respect to that operation, 
then their intersection is closed. As an optional topic at the end of the 
chapter, we use equivalence relations to give a formal construction of the 
rational numbers from the integers. 

In Chapter 5, we bring together many of the ideas of previous chapters 
in the context of the set of integers. We begin with an axiomatic approach 
so that students can appreciate the distinctions among definitions, axioms, 
and theorems. We introduce mathematical induction in Section 5.2 as a 
consequence of the Well-Ordering Principle which is taken as an axiom. We 
prove the Unique Factorization Theorem in Section 5.4 via the standard 
approach of introducing greatest common divisors and properties of primes. 
Section 5.5 discusses congruences in some detail, defines the set Z,, of congru- 
ence classes of integers mod n, and proves its algebraic properties. Section 
5.6 gives the students an example of how mathematicians extend results: 
by trying special cases and then conjecturing suitable generalizations. The 
mathematics in this section is not difficult but it does give the students some 
exposure to the creative process. 

Chapter 6 introduces the theory of infinite sets and discusses countable 
and uncountable sets and cardinality. The mathematics is admittedly more 
difficult in this chapter and students should have developed a good mastery 
of the topics in Chapters 2 and 3. Sections 5.5 and 5.6 are not prerequisite 


xii 


PREFACE FOR THE INSTRUCTOR 


for this chapter. Induction (Section 5.2) needs to have been studied, of 
course; the Division Algorithm (Section 5.3) is mentioned in the proof of 
Theorem 6.1.8 and prime factorization (Section 5.4) in the proof of Theorem 
6.1.11. (Some alternative proofs of the latter don’t use prime factorization.) 

Chapter 7 introduces ordered fields and compares and contrasts such 
fields with the integers as developed in Section 5.1. Section 7.2 on the real 
numbers gives the students some of the flavor of a beginning real analysis 
course, presenting topics such as the Least Upper Bound Axiom, the Archi- 
medean Principle, and the fact that between any two real numbers are 
infinitely many rational numbers. Preparation for Section 7.2 does not require 
all of Section 7.1. Section 7.3 is a standard introduction to complex numbers 
including DeMoivre’s Theorem and roots of unity. Except for the definition 
of field, this section is independent of Sections 7.1 and 7.2. 

The goal of Chapter 8 is for students to learn some techniques for solving 
polynomial equations, a topic often neglected. The chapter begins with a 
study of polynomials over an arbitrary field and the relation between zeros 
of polynomials and their linear factors. We state and prove unique factoriza- 
tion and note analogies with the integers. We then discuss (without proof) 
the Fundamental Theorem of Algebra and give some methods for finding 
rational roots of a polynomial with integer coefficients. 

Unquestionably, we provide too much material for a single semester. 
The majority of the core, Chapters 1-5, is essential; instructors can choose 
topics from the later chapters as time permits or can include them in an 
honors version of the course. But the goal of the entire book is to show our 
students how mathematicians think as well as some of the fascinating things 
they think about. 
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INTRODUCTION 
FOR THE STUDENT 


This text and others like it are often described as transition books, primers 
for higher-level mathematics. What do we mean by a transition book and 
why is such a book necessary? 

By now, you have seen a significant amount of mathematics, including 
at least a year or two of calculus and possibly some linear algebra. The 
mathematics in these courses is quite sophisticated. Calculus, for example, 
as developed by Newton and Leibniz, is the greatest mathematical achieve- 
ment of the seventeenth century. The tremendous scientific advances of 
the last 300 years would not have been possible without the formulas and 
algorithms that follow from the theory of the integral and the derivative. 
Soon, you will take additional courses in such fields as probability, combinato- 
rics, dynamical systems, linear programming, or topology, to list just a few 
examples. Given that calculus involves such high level mathematics, why 
does a math major need a transition course? Why not just plunge right into 
these so-called advanced courses? 

One reason stems from the history of calculus itself. In the seventeenth 
and eighteenth centuries, mathematicians would manipulate infinite series 
much like ordinary finite sums. The results were usually quite correct, but 
the methods often led to errors. Here is an example: 

The MacLaurin series expansion of In(1 + x) is given by: 


x x4 


x 
Inlt+xy=x-Z+Q-7te.. (*) 


This series converges for —1 < x = 1. Differentiating both sides of (*) gives: 


1 
1+x 


=1-xt+x-x34+.... (**) 
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If we substitute x = 1 in (**), we get 


1 
-=1-1+1-1+.... 
7 1-1+1-1 


The right-hand side of the equation, however, is not a convergent series. 
What has gone wrong here is the indiscriminate differentiation of a power 
series, term by term, as if it were the same as a finite sum. Sometimes this 
can be done and sometimes it cannot. In fact, (**) is a true equation for all 
x such that —1 < x < 1. What becomes important is to prove under what 
conditions a power series can be differentiated term by term. 

Another example is provided by letting x = 1 in (*) above, giving the 
true equation: 


In2)=1—34+5- te 5) 


iu 
4 

Now if we rearrange the terms of the infinite series on the right-hand 
side, we obtain the equation: 


Since we know that In(2) # 0, we have an apparent contradiction. The 
contradiction is resolved by noting that the right-hand side of (***) is a 
conditionally convergent series; that is, the series converges but if the terms 
of the series are replaced by their absolute values then the resulting series 
diverges. It can be proven that a rearrangement of a conditionally convergent 
series will not necessarily converge to the same sum as the original series. 
In fact, a conditionally convergent series can be rearranged to converge to 
any given number or even to diverge. 

In both of these examples, mistakes are made by treating an infinite sum 
the same as a finite sum. In trying to determine which rules that apply to 
finite sums also apply to finite series, it is necessary first to define carefully 
what we mean by an infinite series and then prove properties of series 
using that definition. As each property is verified, it can be used to prove 
subsequent properties. 

Mathematicians of earlier centuries commonly manipulated formulas and 
symbols indiscriminately without regard for whether or not those manipula- 
tions were justified. Nevertheless, often these ‘‘missteps’’ actually led to 
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true formulas or provided insights into why something was true. The great 
mathematician Leonhard Euler (1707-1783) is famous for making discover- 
ies in a totally nonrigorous way. Here is an example. 


You may recall that the infinite series S — is a convergent series by the 


n=1 


so-called p-test. But knowing that the series converges does not tell you to 
2 


; : f 7 
what number the series converges. In fact, this series converges to —. 


6 
Euler’s “proof” of this fact goes like this: the MacLaurin series expansion 
of sin x is 
met gl 
ae le ae Tee 
Dividing by x gives the equation: 


sinx _ x? 


x —3I° 5! 7! 


sin x 
If we set —— = 0, the roots are: +7, +27, +37,.... 
x 


If we treat the infinite series as if it were a polynomial, as Euler did, then 
we can factor it as 


(Dl Dl-a)lt a) s la) ~ 


since this infinite product has the same roots and the same constant term as 
the infinite series. 


So we get: 

sin x XP AO xe 
Se lee Se Sat 
x 3! S! 7! 


“CFO +0 -a Ora) O alls) ~ 
(5) a) -38)-- 


If we multiply out this last infinite product, as we would a finite product, 
we see that the coefficient of the x? term is the infinite series 


On the other hand, the x? term of the MacLaurin series is — 4 =— *. 
Multiplying both expressions by —7’ gives us Euler’s result. 
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We emphasize that Euler was not indifferent to the idea of convergence 
of an infinite series and, of course, he knew that a power series was not 
the same as a polynomial. His insight and cleverness produced significant 


: . V1 7 f . 
mathematics. He was later able to give a proof that » ars that is consid- 
n=1 
ered rigorous by today’s standards. 
One of the important byproducts of finding a rigorous proof of a mathe- 
matical theorem is that it can often lead to new results or even to generaliza- 
tions of the theorem, generalizations that may be impossible to discover by 


informal methods such as the ones employed by Euler. 


For example, if you change the exponent in the series > . from a 2 to 


n=1 
another integer such as 3, 4... and so forth, you may well ask to what 
numbers these different series converge. There is a long history of attempts 
to answer this question. 


© 


; : 1 
First, we define a function (s) = > 2 for s a real number greater than 


n=1 
2 
1, so that ¢(2) is the series we considered above and ¢(2) = a (Note: the 


symbol ¢ is the Greek letter zeta and the function ¢(s) is known as the 
Riemann zeta function, named after the mathematician Bernhard Riemann 
(1826-1866). You may recall the name Riemann from the study of Riemann 
sums in calculus.) Euler derived a formula for ¢(s) whens is an even positive 
integer. The formula involves a power of 7 and the so-called Bernoulli 
numbers, which we won’t define here. No formula for ¢(s), when s is an odd 
positive integer, is known. In 1978, the French mathematician R. Apery 
proved that £(3) is irrational, but not much else is known about these num- 
bers. This and countless other examples show that mathematics is not a 
closed subject. Many unsolved problems and even new areas of mathematics 
await the budding mathematician. 

A transition book such as this one, then, is an introduction to the logic 
and rigor of mathematical thinking and is designed to prepare you for more 
advanced mathematical subjects. 

We designed our course and this book with three goals in mind. First 
and foremost of these is to show you the elements of logical, mathematical 
argument, to have you understand exactly what mathematical rigor means 
and to appreciate its importance. You will learn the rules of logical inference, 
be exposed to definitions of concepts, be asked to read and understand 
proofs of theorems, and write your own proofs. At the same time, we want 
you to become familiar with both the grammar of mathematics and its style. 
We want you to be able to read and construct correct proofs, but also to 
appreciate different methods of proof (contradiction, induction), the value 
of a proof, and the beauty of an elegant argument. 

A second goal is for you to learn how to do mathematics in a context, 
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by studying real, interesting mathematics and not just concentrating on form. 
We have chosen topics that do not overlap significantly with other courses 
(such as the properties of the integers, the nature of infinite sets, and the 
complex numbers), that are essentially self-contained, and that will be useful 
to you later when you are exposed to more specialized, advanced mathe- 
matics. 

Finally, we want you to realize that mathematics is an ongoing enterprise, 
with a long, fascinating, and sometimes surprising history. The notes sprin- 
kled throughout the text are deliberately eclectic. ‘“Historical Comments” 
give you pictures of the tremendous successes (and equally spectacular fail- 
ures) of brilliant mathematicians of the past. “Mathematical Perspectives”’ 
may spotlight questions that are still unanswered and are the subject of 
current research, or that simply show an interesting further aspect of the 
material you have just studied. 


MATHEMATICAL 
REASONING 


Introduction: Early Mathematics 


Mathematics in one form or another has probably existed in every civilization. 
Many cultures flourished in the area of the Tigris and Euphrates rivers known 
as Mesopotamia in what is now Iraq. The Babylonian civilization of that 
area has left us records of mathematical activity as far back as the years 
1800-1600 B.c. Writing on clay tablets, they recorded solutions of algebraic 
problems and compiled tables of squares, cubes, square roots, and cube 
roots, even some logarithms. They also listed Pythagorean triples, numbers 
such as 3, 4, 5 and 5, 12, 13, which make up the sides of a right triangle 
(using of course their own symbols and number system). So the Babylonians 
knew the Pythagorean Theorem more than a thousand years before the 
time of Pythagoras! Even earlier than the Babylonians, the so-called Middle 
Kingdom of Egypt (2000-1800 B.c.) produced some sophisticated calculations 
of areas and volumes. One of their great achievements was the calculation 
of the volume of a truncated pyramid with square base and square top. (See 
Exercise D4 of Section 1.3.) 

Beginning in about the sixth century B.c. in Greece, an extraordinary 
new chapter in the history of mathematics began. For the first time (at least 
as far as we know) the methods of reasoning and logical deduction, as 
opposed to trial and error calculations, were employed to arrive at new 
mathematical truths. Prior to the Greeks, in Egypt and Babylonia for exam- 
ple, geometric results and algebraic formulas were discovered by empirical 
methods; that is, by trying special cases and then extrapolating to the general 
case. This empirical method, called inductive reasoning, has been used by 
mathematicians throughout history. It is not merely a perfectly acceptable 
method of making new discoveries but is really an indispensable way to 
arrive at new results. But the Greeks insisted that any new results had to 
be proved and that meant using the rules of logic. This method is called 
deductive reasoning. In this chapter we introduce some of the important 
ideas of logic that are frequently used in mathematics. 
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STATEMENTS 


The Notion of Proof 


Contrary to popular opinion, mathematics is not just computations and equa- 
tions. It might be better described as an attempt to determine which state- 
ments are true and which are not. The subject matter may vary from numbers 
to geometric figures to just about anything, but the form is always the same. 

The process of discovery in mathematics is twofold. First comes the 
formulation of a mathematical statement or conjecture. This formulation 
often comes after much hard work that usually includes a trial-and-error 
process, many false starts, and sometimes extensive calculation. The second 
part of the process is the verification or proof that the statement that we 
have formulated is true or false. This part too can involve much trial and 
error and long, hard work. It is this part of the process that we will study 
in this text. 

To begin to prove a mathematical statement it is necessary to begin with 
certain statements that we accept as given, called axioms, and try to logically 
deduce other statements from them. These deductions are called proposi- 
tions, or, if they’re particularly important, theorems. The arguments, the 
logic we use to make the deductions, are called proofs. 

The beauty of mathematics often comes from the fact that propositions 
are not always obvious, and can in fact be surprising. Moreover, proofs may 
not be easy to construct, and may require insight and cleverness. They 
certainly require a little experience, especially some familiarity with the most 
commonly used logical methods. This “‘sophistication’”’ won’t come overnight; 
it’s one of the major goals of this book. 

Let’s begin with an example. 


Suppose that the following question were posed to you: Is the square of an 
even integer itself an even integer? You might start by trying some calcula- 
tions: 


2=4, a = 16, 6 = 36, 8 = 64, 10° = 100. 


Each one of these examples gives a positive answer to the question. Do 
these answers constitute a proof? 

If the problem asked us to show that the squares of some even integers 
are even, we would be done. But that is not the question. There is an implied 
universality about the question. Is the square of every even integer even? 
Clearly, more must be done than trying a few examples. In fact just working 
out examples will not suffice, because no matter how many we do, there will 
always be an infinite number of cases that we haven’t done. 

So how do we proceed? The first step might be to reword the question 
as a statement. 
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“The square of every even integer is even.” 
Now let’s reword the statement using symbols: 
“Tf n is an integer and n is even, then n’ is even.” 


In order to begin a proof, we have to ask ourselves: What does it mean 
for an integer to be even? Well, we know that an integer is even if it is two 
times an integer. That is the definition of even integer. 

Now we need to translate this definition into symbols: 


“yn is an even integer if nm = 2m for some integer m.” 


Now let’s think about what we’re trying to prove. We want to show that 
the square of the even integer v is even. So we start by letting n be an even 
integer and try to show that n’ is twice an integer. We do the obvious algebra 
step; we write n = 2m and square both sides. 

We get n> = 4m’ = 2(2m’). Since 2m’ is an integer, this shows that n? is 
twice an integer and therefore n’ is even. 

The proof is now complete. 

This is an example, admittedly not a complicated one, of the proof of a 
mathematical statement. 


Note that in this last example, we made some assumptions about multipli- 
cation of integers; namely, that multiplication satisfies a commutative law 
and an associative law and that the product of integers is still an integer. 
These laws make up some of the axioms of the integers and will be discussed 
in Chapter 5. For the purpose of examples in this and later chapters, we 
will assume the well-known arithmetic properties of the integers and real 
numbers. (See Section 5.1 for properties of the integers and Section 7.1 for 
properties of the real numbers.) 

The notion of multiples of an integer is used in some examples. An 
integer x is called a multiple of an integer n if x = kn for some integer k. 
So the even integers are the multiples of 2. The multiples of 3 consist of the 
integers 0, +3, +6, +9, and so on. 


Statements 


In the course of this chapter, we will discuss many of the rules of logic and 
inference needed in the previous example and others like it in mathematics. 

In the previous example, we used the word “‘statement”’ several times. 
In this text, the word will have a specific meaning. 


Definition |.1.1 A statement is any declarative sentence that is either 
true or false. 
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EXAMPLE 3 


EXAMPLE 4 


EXAMPLE 5 
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A statement then will have a truth value. It is either true or false. It 
cannot be neither true nor false and it cannot be both true and false. 
Some examples of statements are given next: 


John Fitzgerald Kennedy was the 35th president of the United States. 
Marie Curie did not win the Nobel Prize. 
34+1=4. 


The fourth example is a mathematical statement, which of course is true. 
We commonly write mathematical statements with symbols for convenience, 
although you should think of them not as ‘‘formulas,” but as full-fledged 
sentences, with a subject, a verb, and possibly other parts of speech. 

In the rest of this chapter, we examine what mathematical statements 
can look like and some methods that can be used to prove them. For conve- 
nience, we often use letters, most often P or Q, to denote statements. 


Some sentences, even some mathematical ones, are not statements. For 
example, consider a typical sentence from algebra: x + 1 = 2. Here, x isa 
variable; it’s a symbol that stands for an undetermined number. The sentence 
is a statement if we specify what number x stands for. It’s a true statement 
if x stands for 1, and it’s false for any other x. We could label this sentence 
P(x) because it depends on the variable x. So P(1) is a true statement and 
P(x) is a false statement if x # 1. 
Note, however, that if x is not specified, then P(x) is not a statement. 


We will call any sentence like the one from the previous example an 
open sentence. 


Definition 1.1.2 An open sentence is any declarative sentence con- 
taining one or more variables that is not a statement but becomes a 
statement when the variables are assigned values. 


The values that can be assigned to the variables of an open sentence will 
depend on the context. They may come from the real numbers as in the 
example x + 1 = 2 or from the complex numbers or even just the positive 
integers. The values do not even have to be mathematical. For example, the 
sentence ‘“‘He was the 16th president of the United States” is an open 
sentence containing the variable ‘“‘he”’ and is therefore a true statement when 
“he” is assigned the value ‘‘Abraham Lincoln”’ and is false otherwise. 


EXAMPLE 6 


EXAMPLE 7 
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An open sentence is usually written P(x), P(x, y), P(x, y, z), and so on, 
depending on the number of variables used. 


Quantifiers 


An open sentence like x + 1 = 2 can, as we have seen, be made into a 
statement by substituting a value for the variable or, in the case of an open 
sentence with more than one variable, by substituting a value for each of 
the variables. 

Another way an open sentence can be made into a statement is by 
introducing quantifiers. For example, for the open sentence x + 1 = 2, we 
could say: For every real number x, x + 1 = 2. This sentence is now a 
mathematical statement that happens to be false. The quantifier introduced 
here is the phrase “for every real number x” and is called a universal 
quantifier. Another way to modify P(x) is to write: there is a real number 
x such that x + 1 = 2. Note that this statement is true. The quantifier in 
this example, “‘there is a real number x,” is called existential. 

Once a quantifier is applied to a variable, then the variable is called a 
bound variable. In the example “‘For every real number x, x + 1 = 2” of 
the previous paragraph, then, the variable x is a bound variable. A variable 
that is not bound is called a free variable. 

If P(x) is an open sentence, then the statement: ‘‘For all x, P(x)” means 
that for every assigned value a of the variable x, the statement P(a) is true. 

The statement ‘“‘For some x, P(x)’ means that for some assigned value 
of the variable x, say x = a, the statement P(a) is true. This statement may 
also be worded: ‘‘There exists a value of x such that P(x).”’ 

Sometimes, in a statement containing universal quantifiers, the words 
“for all’ or “for every” are not actually in the sentence but are implied by 
the meaning of the words. Here are some examples. 


If n is an even integer, then n’ is even. 


On the surface, this sentence might seem to be an open sentence rather 
than a statement since it contains the variable n. However, implicit in the 
wording is the meaning: for every integer n, if n is even, then n’ is even. So 
the variable n has been modified by a universal quantifier and is now a 
bound variable, making the sentence a statement. As we saw in Example 1, 
it is actually a true statement. 


A triangle has three sides. 


This statement contains a universal quantifier since it is really asserting 
that every triangle has three sides. Another way to word this statement is 
‘“‘For every plane figure 7, if T is a triangle, then T has three sides,’’ or more 
simply “If T is a triangle, then T has three sides.” 


EXAMPLE 8 


EXAMPLE 9 


EXAMPLE 10 


EXAMPLE || 


EXAMPLE 12 


EXAMPLE 13 
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The square of a real number is nonnegative. 


Again this statement has a universal quantifier since it is saying that the 
square of every real number is nonnegative. It could also be written as: 


“If x is a real number, then x? = 0.” 
All triangles are isosceles. 
This statement has a universal quantifier as well. It just happens to be false. 


We will sometimes use the symbol V to mean “‘for all” or “‘for every.” 
Example 8 can be rewritten: ‘“‘V x, if x-is real, then x? = 0.” 

The following examples give some of the forms that a statement with an 
existential quantifier can take. 


Some even numbers are multiples of 3. 


First note that even though this statement is written in plural form (‘“‘some 
even numbers’’), it may be phrased: ‘“‘There exists an even integer that is a 
multiple of 3.’ To prove this statement, one need only find one even number 
that is a multiple of 3. Since 6 is such a number, the proof is complete. 


Some real numbers are irrational. 


This statement asserts something about some, but not all, real numbers. 
It may be reworded as: ‘“There exists a realnumber x such that x is irrational.” 
It is a true statement provided that there is at least one real number that is 
not rational. In Section 1.4, we will prove this statement by showing that 
V2 is irrational. 


There is a real number whose square is negative. 


This statement also makes an assertion about some real numbers. Note 
that it is a false statement. 


The symbol J is used to mean ‘“‘there exists’’ or “‘there is.” The symbol 
> is read “‘such that.’”” Example 11 then can be expressed as: J x, x real, 
> x is irrational. 

A statement of course may have more than one quantifier. 


For every real number x, there is an integer n such that n > x. 


This statement contains both a universal and an existential quantifier. 


EXAMPLE 14 


EXAMPLE [5 
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The following statement, which is a definition from calculus, also has both 
a universal and an existential quantifier: A real-valued function f(x) is 
bounded on the closed interval [a, b] if f(x) is defined on [a, b] and there 
exists a positive real number M such that |f(x)| = M for all x € [a, b]. 

For example, the function f(x) = x? + 1 is bounded on [0, 2] because 
|f(x)| = 5 for all x € [0, 2]. 


The order in which quantifiers appear in a statement is important. If 
P(x, y) is an open sentence in the variables x and y, then the statement 


Vx, dy 3 P(x, y) 

does not always mean the same as the statement 
dy 3 Vx, P(x, y). 
To see this, consider the following example. 


The statement “Every real number has a cube root’ can be written in 
the form: 


V real numbers x, J a real number y 3 y? = x. 


This statement is true and is a consequence of the Intermediate Value 
Theorem of Calculus. However, the statement 


da real number y 3 V real numbers x, y? = x 


means that every real number x is the cube of a single number y and is 
clearly false. 


Negations 


Two of the statements just given, “‘All triangles are isosceles”. and ‘“There 
is a real number whose square is negative,’ were noted to be false. To 
prove that they are false it is necessary to prove that the negations of these 
statements are true. 


Definition I.1.3 If P is a statement, the negation of P, written —P (and 
read “‘not P’’), is the statement ‘‘P is false.” 


There are several alternative ways to express —P. For example, ‘‘P is not 
true” and “It is not true that P”’ are the same as our definition. In addition, 


EXAMPLE 16 


EXAMPLE 17 


EXAMPLE 18 


EXAMPLE 19 


EXAMPLE 20 


EXAMPLE 21 
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it is often possible to express the negation of a sentence more elegantly by 
using better English style. Note however that the negation of a statement P 
must be phrased in such a way that exactly one of the statements P or —P 
is true and one is false. Some examples follow: 


P: — John Fitzgerald Kennedy was the 35th president of the United 
States. 
=P: John Fitzgerald Kennedy was not the 35th president of the 
United States. 


Notice (it’s obvious!) that exactly one of P and —P is true, the other false. 


P: Marie Curie did not win the Nobel Prize. 
=P: Marie Curie won the Nobel Prize. 


We really have a double negative here, the negation of a negatively 
phrased statement, that is better stated positively. 


As this example shows, symbolic mathematical statements can often be 
negated symbolically. 


We can also talk about the negation of open sentences P(x), P(x, y), 
P(x, y, Z), ... in one or more variables. To use the one-variable case as an 
example, we will write =P(x), read ‘‘not P(x),” to mean the open sen- 
tence in the variable x that becomes the statement —P(a) when x is as- 
signed the value a. 

The negation of open sentences with more than one variable is defined 
in a similar fashion. In writing such sentences, we use whatever form good 
English or mathematical usage would dictate. 


P(x): x +1=2. 
aP(x): «+1 #2. 


P(x): x >5. 
aP(x): x s5. 


P(n,m): n+ mis even. 
=P(n,m): n+ mis odd. 


In the next example, we consider the negation of a statement with a 
universal quantifier. 


EXAMPLE 22 


EXAMPLE 23 


EXAMPLE 24 


EXAMPLE 25 
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P: If n is an even integer, then n? is even. 
=P: It is not true that if 7 is an even integer, then n? is even. 


Every statement can be negated by simply putting the phrase “‘it is not 
true that” in front of it. Usually, though, expressing the negation this way 
does not convey what the negated statement actually means. 

In this example, P contains the universally quantified variable n and can 
be written: 


For all integers n, if n is even, then n’ is even. 


Therefore, P is false if there is some even integer whose square is not 
even. So —P must be the statement 


For some even integer n, n? is not even or 
There exists an even integer n such that n? is odd. 


Note that in negating P, the universal quantifier is replaced by an existen- 
tial quantifier. 


P: Every day this week was hot. 


The negation of this statement is not ‘‘Every day this week was not hot,” 
since these two statements do not cover all possibilities. There can be a week 
in which some days are hot and some are not. To negate P, it is sufficient 
that there be at least one day that is not hot. So we can write the negation as: 


Some day this week was not hot or 
There was a day this week that was not hot. 


P: Every polynomial function is continuous everywhere. 
aP: There exists a polynomial function that is not continuous some- 
where. 


There are really two universal quantifiers in statement P. It says that all 
polynomial functions are continuous at all real numbers. So the negation 
should say that some polynomial functions are not continuous at some 
real numbers. 


The negation of the statement ‘“‘All Red Sox players are slow” is “‘There is 
a Red Sox player who is not slow,”’ or equivalently, ‘““Not all Red Sox players 
are slow.”’ An incorrect way to negate this statement is ‘‘All Red Sox players 
are not slow.” We make no comment about which of these statements is true. 


If we negate a statement with an existential quantifier, then a universal 
quantifier is required. 


10 


EXAMPLE 26 


EXAMPLE 27 


EXAMPLE 28 


EXAMPLE 29 
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P: — There is a real number whose square is negative. 


Statement P says that we can find a real number whose square is negative. 
The negation of P means that we cannot find such a number. In other words, 
the negation of P is 


The square of every real number is not negative. 
To negate P by saying “There is a real number whose square is nonnega- 
tive’’ would not be correct because this statement and P could, theoretically 


at least, both be true. 


P: Some real-valued functions are not integrable. 
AP: Every real-valued function is integrable. 


As Examples 22-27 show, there are thus two basic rules about negating 
statements with quantifiers. 


Rule 1: The negation of the statement ‘‘For all x, P(x)” is the statement 
“For some x, —P(x).” 


Rule 2: The negation of the statement ‘‘For some x, P(x)” is the state- 
ment ‘For all x, —P(x).” 


Of course, if a statement contains both universal and existential quantifi- 
ers, then in order to negate the statement, it is necessary to apply both of 
these rules. 

If astatement S has the form ‘“‘For all x, Jy 3 P(x, y)” then the negation 
of S is “For some x, =(Ay 3 P(x, y))” by Rule 1. Since Rule 2 tells us that 
the negation of “‘Ay 3 P(x, y)” is ‘For all y, =(P(x, y)),” then the negation 
of S becomes ‘“‘For some x, for all y, =(P(x, y))”’ or “Ax 3 for all y, =(P(x, y)).” 

Similarly, the negation of “Ax 3 Vy, P(x, y)’” is “Wx, dy 3 A(P(x, y)).” 


P: For every real number x, there is an integer n such that n > x. 
=P: There is a real number x such that for every integer n,n = x. 


The statement in this last example is known asthe Archimedean Principle. 
See the Historical Comments at the end of Section 1.2. 


P: There is a continuous real-valued function f(x) such that f(x) is 
not differentiable at any real number c. 
oP: For every continuous real-valued function f(x), there is a real 


number c such that f(x) is differentiable at c. 
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Surprisingly, statement P of Example 29 is true. There are continuous 
functions that are not differentiable at any point! An example, due to K. 
Weierstrass, is 


f(x) = > 508 ((15)" wx). 


This is a function that is continuous everywhere but differentiable nowhere. 


Writing Proofs 


As you start to read and write proofs, you will see that they require a certain 
writing style to ensure clarity and readability. Take, for example, the proof 
in Example 1, that if n is an even integer, then n’ is even. Suppose that a 
proof were written as follows: 


n = even = 21. 
n> = 4¢ = even. 


This rather brief proof has the correct mathematical steps, but is lacking 
in explanation and hence in clarity and also suffers from poor notation. One 
should begin by clearly stating the assumption: ‘‘Let n be an integer and 
suppose that 7 is even.” Then it can be noted that this means that ‘‘n = 2t 
for some integer ¢.”’ But writing ‘“‘n = even” is sloppy notation. The word 
“even” is an adjective and should precede a noun; in this case, the word 
“number.” But even the expression ‘“‘n = even number” is not appropriate. 
The phrase ‘“‘even number”’ does not belong in an equation. Equations should 
only contain numbers and symbols. 

After writing ‘‘n = 2t for some integer ’’ an explanation should be given 
for the next step: “Squaring both sides, we get n’ = 4¢° = 2(2r’).” Then 
finally, one should note that “‘since 27’ is an integer, it follows that n’ is even.” 


“» HISTORICAL COMMENTS: EARLY GREEK MATHEMATICS 


A history of early Greek mathematics was written by Eudemus in the fourth 
century B.c. Although this book is now lost, a summary of it was written by 
Proclus in the fifth century a.p. According to Proclus’s work, the earliest 
known mathematician to use the deductive method was Thales of Miletus. 
Thales founded the earliest Greek school of mathematics and philosophy. 
Among the results attributed to Thales are: the base angles of an isosceles 
triangle are congruent; triangles with corresponding angles equal have pro- 
portional sides; and an angle inscribed in a semicircle is a right angle. Each 
of these results was proven by deductive methods. 

About sixty to eighty years after Thales, an important school of mathe- 
matics and philosophy was founded in southern Italy by Pythagoras (ca. 
585-497 B.c.). One of the great contributions of the Pythagoreans, as the 
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members of this school were known, is the idea that mathematical entities 
such as numbers and geometrical figures are in fact abstractions distinct from 
the material world. To the Pythagoreans the concept of number was the key 
to understanding the mysteries of the universe. By numbers they always 
meant whole numbers. Although they worked with ratios of whole numbers, 
which we call fractions, they did not consider them numbers per se. The 
Pythagoreans are credited with discovering the Pythagorean Theorem but 
probably did not give a formal proof of it. And as the records left behind 
by the Babylonians indicate, the Pythagoreans were not the first to use it. 


Exercises I.1 


1. Determine whether each of the following sentences is a statement, an 
open sentence, or neither. 
(a) The Boston Celtics have won 16 NBA championships. 
(b) The plane is leaving in five minutes. 
(c) Get a note from your doctor. 
(d) Is that the best you can do? 
(e) Excessive exposure to the sun may cause skin cancer. 
(ff) 5+2=6. 
(g) Someone in this room is the murderer. 
(h) x? +140. 
(i) For every real number x, x? + 1 # 0. 
(j) The equation of a circle of radius 1 with center at the origin is x? + 
y=. 
(k) If n and m are even integers, then nm is even. 


2. For each of the following statements, determine if it has any universal 
or existential quantifiers. If it has universal quantifiers, rewrite it in the 
form “for all....’ If it has existential quantifiers, rewrite it in the form, 
“there exists... such that ....” Introduce variables where appropriate. 
(a) The area of a rectangle is its length times its width. 

(b) A triangle may be equilateral. 

(c) 8-8 =0. 

(d) The sum of an even integer and an odd integer is even. 

(e) For every even integer, there is an odd integer such that the sum of 
the two is odd. 

(f) A function that is continuous on the closed interval [a, b] is integrable 
on [a, b]. 

(g) A function is continuous on [a, b] whenever it is differentiable on 
[a, b]. 

(h) A real-valued function that is continuous at 0 is not necessarily 
differentiable at 0. 

(i) All positive real numbers have a square root. 

(j) 1 is the smallest positive integer. 
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. Write the negation of each statement in Exercise 2. 


. Write the negation of each of the following statements. 


(a) All triangles are isosceles. 

(b) Some even numbers are multiples of three. 

(c) Every door in the building was locked. 

(d) All new cars have something wrong with them. 

(e) Some angles of a triangle are greater than 90 degrees. 
(f) There are sets that contain infinitely many elements. 


. Write the negation of each of the following statements. 


(a) There is a real number x such that x7 +x +1=0. 

(b) Every real number is less than 100. 

(c) If fis a polynomial function, then fis continuous at 0. 

(d) If fis a polynomial function, then fis continuous everywhere. 

(e) V x, x real, J a real number y 3 y = x’, 

(f) There is a real-valued function f(x) such that f(x) is not continuous 
at any real number x. 


. Consider the following statement P: ‘“The square of an even integer is 


divisible by 4.” 

(a) Write P as a statement in the form, ‘for all...,if..., then... .” 

(b) Write the negation of P. 

(c) Prove P or =P. Explain any inductive reasoning you use to conjec- 
ture that P is true or that P is false. 


. Consider the following statement P: ‘“The sum of two even integers is 


divisible by 4.” 

(a) Write P as a statement in the form “for all... ,if...,then....” 

(b) Write the negation of P. 

(c) Prove P or =P. Explain any inductive reasoning you use to conjec- 
ture that P is true or that P is false. 


. In Example 14 the definition of a bounded function was given. 


(a) Write the negation of this definition; that is, complete the following 
statement: “‘A real-valued function f(x) is not bounded on the closed 
interval [a, b] if... .” 

(b) Give an example of a bounded function on (0, 1]. Justify your answer 
by determining a value for M. 

(c) Give an example of an unbounded function on [0, 1]. Justify your 

answer. 

Suppose the definition of bounded function were worded this way: 

“A real-valued function f(x) is said to be bounded on the closed 

interval [a, b] if for all x € [a, b], there exists a positive real number 

M such that |f(x)| Ss M.” Does this definition mean the same as the 

one given in Example 14? If not, explain how they differ. Could this 

new definition make sense as the definition of a bounded func- 
tion? Explain. 


(d 


wa 
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9. 


10. 


11. 


12. 


13. 


A real-valued function f(x) is said to be increasing on the closed interval 
(a, b] if for all x,, x. © [a, b], if x < x, then f(x) < f(%). 

(a) Write the negation of this definition. 

(b) Give an example of an increasing function on [0, 1]. 

(c) Give an example of a function that is not increasing on [0, 1]. 


(a) State a definition for a real-valued function f(x) to be decreasing on 
a closed interval [a, ]. 

(b) Give the negation of this definition. 

(c) Give an example of a decreasing function on (0, 1]. 

(d) Give an example of a function on [0, 1] that is neither increasing 
nor decreasing. 


Prove the following corollary of the Archimedean Principle. (See Example 
28 for the statement.) For every positive real number «, there exists a posi- 
tive integer N such that 1/n < « for all n = N. (Note: this exercise is the 
basis for the formal proof that the sequence {1, 5, 3, 4, - . } converges to 0.) 


Use the Archimedean Principle to prove the following: if x is a real 
number, then there exists a positive integer m such that -n <x <a. 


Prove that if x is a positive real number, then there exists a positive 


: 1 
integer n such that—~<x<n. 
n 


Discussion and Discovery Exercises 


D1. Consider the following question. What positive integers n can be written 


as the difference of two squares? For example, 5 = 3? — 2? and 24 = 
5° — 1°. The following table lists the expression n = x? — y’ for varying 
values of x and y. Since v is positive, we assume that x > y = 0. 


x] y v-y x y v-y x y y-y 
1 0 1 5 4 9 8 0 64 
2 0 4 6 0 36 8 1 63 
2 1 3 6 1 35 8 2 60 
3 0 9 6 2 32 8 3 55 
3 1 8 6 3 27 8 4 48 
3 2; 9) 6 4 20 8 5 39 
4 0 16 6 5 11 8 6 28 
4 1 15 7 0 49 8 i 15 
4 2 12 7 1 48 9 0 81 
4 3 7 7 2 45 9 1 80 
5 0 25 7 3 40 9 2 77 
5 1 24 7 4 33 9 3 72 
5 2 21 7 5 24 9 4 65 
5 3 16 7 6 13 9 5 56 


ll 


D2. 


D3. 


D4. 


DS. 


D6. 


D7. 


D8. 
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(a) Based on the table, conjecture a theorem that states exactly which 
positive integers can be written as the difference of two squares. 

(b) Try to prove your conjecture. 

(c) Notice that some integers appear more than once as the difference 
of two squares. What accounts for this? Is it possible that an integer 
will appear infinitely often on this list? Give a reason for your 
answer. 


Reread Example 1. Explain what part of that example involves inductive 
reasoning and what part involves deductive reasoning. See the introduc- 
tion to this chapter for a discussion of the difference between inductive 
and deductive reasoning. 


Use inductive reasoning to find a statement about whether or not the 
sum of two consecutive even integers is divisible by 4. Then prove 
your statement. 


Criticize the following “‘statement” of the Fundamental Theorem of 
Calculus: [’ f(x) dx = F(b) — F(a). 


Criticize the following ‘‘proof”’ of the fact that if nm and m are even then 
n + mis even. 


We know that n = 2tand m = 2t,son + m = 2t+ 2t = 4t. 
Therefore, n + m is even. 


Write out a correct proof. 


Is the following a valid proof that every integer multiple of 4 is even? 
If not, explain what you think is wrong with it and then write your own 
proof. 


Every multiple of 4 has a 4 in it and 4 has a 2 in it. 
Therefore, every multiple of 4 has a 2 in it and therefore a multiple of 
4 is even. 


Consider the following statement P: If 1 is an integer andn’is a multiple 
of 4, then n is a multiple of 4. The following is a “proof” that P is a 
false statement: 


6 = 36 and 36 is a multiple of 4 but 6 is not a multiple of 4. Therefore, 
P is false. 


Is this proof valid? Give your reasons and if you think it is not a valid 
proof, write a correct one. 


Consider the following “‘proof” of the fact that if m is an integer and 
n’ is even, then n is even. 
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By Example 1, the square of an even integer is even. Therefore n can’t 
be odd and hence must be even. 


Is this proof correct? Give reasons for your answer. If you think the 
proof is invalid, write a correct one. 


D9. Finda proof of the Pythagorean Theorem and write it out. What assump- 
tions are used in the proof? 


COMPOUND STATEMENTS 


Conjunctions and Disjunctions 


Statements, especially in mathematics, are often complicated but can be 
seen as built up from simpler statements. Such compound statements can 
be constructed in several different ways, at least two of which are very 
easy. 


Definition 1.2.1 Let P and Q be statements. 


1. The conjunction of P and Q, written P ~ Q (and read “‘P and 
Q”’), is the statement “Both P and Q are true.” 

2. The disjunction of P and Q, written P v Q (and read “P or Q”’), 
is the statement “‘P is true or Q is true.” 


Obviously, P «4 Q is true if P is true and Q is true. Notice, though, that 
there are three ways for P ~ Q to be false: if P is true but Q is false, if P is 
false but Q is true, and if both P and Q are false. 

The disjunction is sometimes referred to as ‘‘the inclusive or’’: it’s all 
right for both statements to be true. There is only one way for P v Q to be 
false: both P and Q must be false. That there are three ways for P v Q to 
be true is inherent in the definition. 

If P(x) and Q(x) are open sentences in the variable x, we can write 
P(x) A Q(x), read P(x) and Q(x), to mean the open sentence in the variable 
x that becomes the statement P(a) ~ Q(a) when x is assigned the value a. 

Similarly, P(x) v Q(x), read P(x) or Q(x), means the open sentence in 
the variable x that becomes the statement P(a) v O(a) when x is assigned 
the value a. 

The conjunction and disjunction of open sentences containing more than 
one variable are defined in like fashion. 


EXAMPLE | 


EXAMPLE 2 


EXAMPLE 3 


EXAMPLE 4 


EXAMPLE 5 
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P: Josef Stalin was the leader of the Soviet Union in 1940. 
Q: Germany invaded the Soviet Union in 1941. 


Here P and Q are both true, so both P ~ Q and P v Q are true as well. 


P: 71<5. 
QO: 3-2=1. 


Since P is false, P ~ Q is false, and since Q is true, P v Q is true. 


P: The Red Sox have won a World Series since World War I ended. 
Q: F. Scott Fitzgerald wrote ‘‘For Whom the Bell Tolls.” 


Both P and Q are false, so both P ~ Q and P v Q are also false. 


A statement or open sentence may be compound but not obviously so. For 
example, if x is a real number, then the open sentence |x| < 3 can be written 
without the absolute value sign as —3 < x < 3, and this sentence is the 
conjunction x < 3 and x > —3. 


The open sentence |x| > 3 can be written as the disjunction x > 3 or x < —3. 


Truth Tables 


Note that the preceding comments about the truth or falsity of P ~ Q and 
Pv Q apply to any statements substituted for P and Q. For example, if P 
and Q are any statements for which P is true and Q is false, then P A Q is 
false and P v Q is true. 

For this reason, it makes sense to consider expressions of the form 
P «Q, Pv Q, or =P where P and Q are variables representing unspecified 
statements. Such expressions are called statement forms. They are not really 
statements themselves but become statements when the variables P and Q 
are replaced by statements. 

We can summarize the comments made previously about the truth values 
of P ~ Q and P v Q by means of truth tables for their statement forms. 


P Q PAaQ P Q Pv@ 
T T T T T T 
T F F T F T 
F T F F T T 
F F F F F F 
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EXAMPLE 7 
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Note that the first two columns of each table list all of the different 
combinations of truth values for the variables P and Q. Each combination 
then determines a corresponding truth value of the statement form. Truth 
tables for other statement forms can be constructed in this way. 


Let P be a statement form. Then the truth table for —P is: 


If P and Q are statement forms, the truth table for P ~ —Q is: 


P Q Pr-O 
7 Tr F 
T F at 
F T F 
F F F 


Negating Conjunctions and Disjunctions 


Sometimes a compound statement formed from two or more statements can 
be expressed in different ways but mean the same thing. For example, con- 
sider the statement 


It is not true that today is sunny and warm. 


In order for the statement ‘“Today is sunny and warm” to be false, it is 
only necessary that one of the two conditions fail to happen; namely, it not 
be sunny or not be warm. Therefore, the statement “‘It is not true that today 
is sunny and warm’’ means the same as 


It is not sunny or it is not warm. 


If one is true then the other is true and vice versa. 

If we let P be the statement ‘“‘Today is sunny” and Q the statement 
“Today is warm,” then we are saying that if —(P A Q) is a true state- 
ment, then (4P) v (—=Q) is true and conversely if (=P) v (—=Q) is true, 
then —(P a Q) is true. 

In fact, for any statements P and Q, —(P « Q) and (—P) v (=Q) mean 
the same thing. This follows from the fact that the statement forms 
—=(P a Q) and (-P) v (=Q) have the same truth tables. 


EXAMPLE 8 
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P oO =P | -@ PAQ (PQ) =P v =O 
"i T F F T F F 
T F F ah F . T 
F A Ay F F 4 a 
F F T a F Tt © 


Logically Equivalent Statements 


Statements and statement forms like the preceding ones are said to be 
logically equivalent. Here is a formal definition. 


Definition 1.2.2 We say that two statements are logically equivalent or 
just equivalent if they are both true or both false. 

We say that two statement forms are logically equivalent if the substi- 
tution of statements for the variables in the forms always yields logically 
equivalent statements. 


Note: If two statement forms have the same truth tables then they are logically 
equivalent. 


The preceding truth table shows that the statement forms =(P « Q) and 
(AP) v (=Q) are logically equivalent. In other words, given any two State- 
ments P and Q, P A Q is false exactly when P or Q is false. 

Similarly, P v @Q is false exactly when both P and Q are false, so 
-(P v Q) is logically equivalent to ~P ~ —Q. A comparison of their truth 
tables will verify this fact as well. 


P Q aP =Q2 Pv@ -(P v Q) =P ~-=@Q@ 
T T F F T F F 
T F F T T F F 
F T T F T F F 
F F T T F T T 
P: The earth is flat. 
Q: The earth revolves around the sun. 
(P AQ): — The earth is flat and revolves around the sun. (False) 
=(P ~Q): — The earth is not flat or it does not revolve around the 
sun. (True) 
Pv@Q: — The earth is flat or it revolves around the sun. (True) 
(Pv Q): The earth is not flat and it does not revolve around the 


sun. (False) 
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In Section 1.1, we saw two other examples of logically equivalent state- 
ment forms: 


(a) =(Wx P(x)) is logically equivalent to dx 3 (—P(x)). 
(b) =(Ax 3 P(x)) is logically equivalent to Vx (=P(x)). 


If we combine these two examples with the logical equivalences associated 
with negating conjunctions and disjunctions, we obtain the following equiva- 
lent statement forms: 


(c) =(Wx (P(x) v Q(x))) is equivalent to dx 3 ((AP(x)) A (AQ(x))). 
(d) =(Vx (P(x) A Q(x))) is equivalent to dx 3 ((-=P(x)) v (AQ(x))). 
(e) A(Ax 3 (P(x) v Q(x))) is equivalent to Vx ((AP(x)) A (AQ(x))). 
(f) (Ax 3 (P(x) A Q(x))) is equivalent to Vx ((—P(x)) v (=Q(x))). 


These equivalences are illustrated by the following examples. 


P: — Every day this week was sunny or hot. 
=P: Some day this week was not sunny and not hot. 


P: There is a real number x such that x > 4 and x < 10. 
=P: If x is a real number, then x = 4 or x = 10. 


P: Every multiple of 6 is even and is not a multiple of 4. 
—P: There is a multiple of 6 that is odd or is a multiple of 4. 


Sometimes statements that appear to be logically equivalent are not. For 
example, suppose that P(x) and Q(x) are open sentences containing the 
variable x and let S and T be the following statements: 


S: For all x, P(x) or O(x). 
T: For all x, P(x) or for all x, Q(x). 


Both of these statements have universal quantifiers and involve a disjunc- 
tion with respect to the statements P(x) and Q(x). At first glance, they appear 
to be saying the same thing and therefore should be logically equivalent. 

But consider the following example: let P(x) be the open sentence ‘“‘x > 
2” and Q(x) the open sentence ‘“‘x < 5.” Letting x be assigned values from 
the real numbers, S and T become: 


S: For all real numbers x, x > 2 or x <5. 
T: For all real numbers x, x > 2 or for all real numbers x, x < 5. 


It is clear that S is a true statement but Tis a false statement. Therefore 
S and T are not logically equivalent. 
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As another example, consider Example 9. The statement ‘“‘Every day this 
week was sunny or hot” does not mean the same as “‘Every day this week 
was sunny or every day this week was hot,”’ because for the first statement 
to be true, every day of the week had to be sunny or hot, so some could have 
been sunny and cool and others cloudy but hot. But the second statement is 
only true if every day was sunny or if every day was hot. 

Now let’s consider the following statements, again assuming that P(x) 
and Q(x) are open sentences containing the variable x. 


S: There exists x such that P(x) or Q(x). 
T: | There exists x such that P(x) or there exists x such that Q(x). 


In this case, S and T are logically equivalent. To prove this we must show 
that S and 7 are either both true or both false. 

First, suppose that S is a true statement. Then there is an assigned value 
of x, say x = a, for which P(a) is a true statement or Q(a) is a true statement. 
Since there are two possibilities, we’ll look at each one separately. Suppose 
that P(a) is true. It follows that the statement 


There exists x such that P(x) 
is true and therefore the disjunction 
There exists x such that P(x) or there exists x such that Q(x) 


is true also. So statement T is true in this case. On the other hand, if O(a) 
is true, then the statement 


There exists x such that Q(x) 


is true, and thus T is true in this case also. 

Now suppose that T is true. Then there is an x such that P(x) or there 
is an x such that Q(x). If there is an x such that P(x), then there is an assigned 
value of x, say x = a, such that P(a) is true. It follows that there is an assigned 
value of x, namely x = a, such that P(a) or Q(a) is true. Thus there exists 
an x such that P(x) or Q(x), so S is true. 

On the other hand, if there is an x such that Q(x), then there is an 
assigned value of x, say x = b, such that Q(b) is true. Hence there is an 
assigned value of x, namely x = b, such that P(b) or Q(b) is true. So there 
is an x such that P(x) or Q(x) and again S is true. 

Therefore if T is true, then S is true. This proves that S and T are 
logically equivalent. 
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The statement 
There was a day this week that was sunny or hot 
is logically equivalent to the statement 


There was a day this week that was sunny or there was a day this week 
that was hot. 


The following statements are equivalent: 
P: There is a real number x such that x > 2 or there is a real number 


x such that x < 5. 
QO: There is a real number x such that x > 2 or x <5. 


Our examples of compound statements given in this section so far have 
been formed from two given statements or statement forms P and Q. We 
can, of course, construct statements from three or more given statements. 


Consider the statement 


According to today’s weather forecast, tomorrow will be cold and cloudy 
or cold and rainy. 


Let P, QO, and R be the statements: 


P: Tomorrow will be cold. 
Q: Tomorrow will be cloudy. 
R: Tomorrow will be rainy. 


Then the original statement has the form: 
According to today’s weather forecast, (P A Q) v (Pa R). 


If we analyze the meaning of this statement, then we see that it could 
be rephrased as 


According to today’s weather forecast, tomorrow will be cold and it will 
be cloudy or rainy. 


In this form our statement becomes 


According to today’s weather forecast, P ~ (Q v R). 


EXAMPLE |[5 
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The last example suggests that the statement forms (P 4 Q) v (Pa R) and 
P a (Q v R) are logically equivalent. To prove this we will examine their 
respective truth tables. Since there are three statements involved, the truth 
tables will have eight rows corresponding to all possible combinations of 
truth or falsity of P, Q, and R. 


P Q R PAQ PAR (P ~Q)v (PAR) 
T T T T T T 
T T F T F T 
T F T F T T 
T F F F F F 
F T T F F F 
F T F F F F 
F F T F F F 
F F F F F F 
P Q R OvR Pa(QvR) 

T T T T T 

T T F T T 

T F T T T 

T F F F F 

F T T T F 

F T F T F 

F F T T F 

F F F F F 


Since the truth tables of (P A Q) v (P a R) and P ~« (Q v R) are the 
same, they must be logically equivalent. 

We now give a mathematical example of the equivalence in the previ- 
ous example. 


Let x be a real number. Suppose that we wish to write the statement 
x > —10 and |x| > 5 


in a simpler form without using absolute value signs. The inequality |x| > 5 
is equivalent to the statement x > 5 or x < —5S. 
By the previous example, the statement 


x > —10 and (x > 5 or x < —5) 
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is equivalent to the statement 
(x > —10 and x > 5) or (x > —-10 and x < —S). 


Because any real number x that is greater than 5 is also greater than 
—10, the statement x > —10 and x > 5 is equivalent to x > 5. Finally, we 
can reword our statement as 


x >S5or-10<x< —-S. 


Tautologies and Contradictions 


Sometimes a statement form will always be true no matter what statements 
are substituted for the variables. Such a statement form is called a tautology. 
A simple example is the statement form P v —P. This statement form is true 
when P is true and it is true when P is false. A statement form is a tautology 
if each of its truth table values is true. 

A statement form that is always false is called a contradiction. The state- 
ment form P A —P is an example. A statement form is a contradiction if 
each of its truth table values is false. 

Note that if S is a tautology then —S is a contradiction and if S is a 
contradiction then —S is a tautology. We will see other examples of tautolo- 
gies and contradictions in this chapter. 


se HISTORICAL COMMENTS: EUCLID’'S AXIOMS 


In Example 28 of Section 1.1, the following statement was given: For every 
real number x, there is an integer n such that n > x. 

This is a true statement. It is in fact a property of the real numbers called 
the Archimedean Principle. An immediate consequence is that there is no 
“‘Jargest’’ real number. Although this fact may seem obvious from our intu- 
itive grasp of the real number line, a formal proof of the Archimedean 
Principle depends on an axiom of the real numbers called the Least Upper 
Bound Axiom. (A discussion of the Least Upper Bound Axiom and a proof 
of the Archimedean Principle is given in Section 7.2.) 

You will often find that statements that seem obvious require not so 
obvious proofs. The starting point of such proofs are axioms, statements that 
we assume as given. You probably first encountered axioms in your study 
of plane geometry in high school. And that of course brings us to Euclid. 

Euclid lived about 300 B.c. and although not much is known about his 
personal life, he is famous for his text E/ements. In it Euclid lists five postulates 
or axioms that form the basis of his study of geometry. In all, the Elements 
consists of 13 books and contains 467 propositions. The proof of each proposi- 
tion is based on earlier propositions and the postulates. The great achieve- 
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ment of Euclid’s Elements is the use of logic and deduction to advance the 
knowledge of mathematics. 

The Elements is not without its defects, however. Sometimes proofs are 
incomplete or assumptions are made without proof. For example, in one of 
his propositions Euclid implicitly uses without proof a statement equivalent to 
the Archimedean Principle. Archimedes (287-212 B.c.), perhaps the greatest 
mathematician of ancient Greece, used it himself without proof (hence the 
attachment of his name to it) and the Greek mathematician Eudoxus (ca. 
408-355 B.c.) used it also. 

These are the five postulates: 


A straight line can be drawn from any point to any point. 

A finite straight line can be produced continuously in a straight line. 
A circle may be described with any center and distance. 

All right angles are equal to one another. 

If a straight line falling on two straight lines makes the interior angles 
on the same side together less than two right angles, the two straight 
lines, if produced indefinitely, meet on that side on which the angles 
are together less than two right angles. 


Ca Gort. 


Euclid’s fifth postulate, known as the Parallel Postulate, is the most 
famous of all of the axioms. What strikes one about these postulates is that 
the first four seem quite straightforward and obvious. The fifth postulate, 
however, is of a different nature. It reads more like a theorem. One has to 
read it carefully in order to understand it. Even then it does not seem readily 
apparent. It seems more like a statement one should try to prove rather 
than accept as given. In fact many mathematicians from Euclid’s time and 
later felt the same way and numerous attempts were made to prove the fifth 
postulate. However, all attempts at a proof failed. Usually when someone 
claimed to have a proof of the fifth postulate, it would turn out that some- 
where in the proof, an assumption had been made that was in fact logically 
equivalent to the fifth postulate. One of the earliest attempts, for instance, 
was made by the Greek astronomer Claudius Ptolemy in about 150 4.p. But 
Ptolemy assumed the following statement: Given a point P in the plane and 
a line L that does not intersect P, there is one and only one line that passes 
through P and is parallel to L. This statement is actually equivalent to 
Euclid’s fifth postulate and is the version of the fifth postulate that actually 
appears in most high school geometry texts. 

The following statements, familiar to students of plane geometry, are 
also equivalent to the Parallel Postulate: 


1. There exist two triangles that are similar but not congruent. 

2. There exists a triangle the sum of whose angles is 180 degrees. 

3. Through any three points not lying on a straight line there is a 
circle. 
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Exercises 1.2 


10. 


1, 


. Let Pbea statement form. Prove that P and —(—P) are logically equiv- 


alent. 


. Let P and Q be statement forms. Write the truth tables for the following 


statement forms. 
(a) (4P) v Q. (b) =(P v Q). 
(c) A(AP) A Q)). — (d) (AP) a (4Q)) v @. 


. Prove that the statement forms —=((4P) v Q) and P a (=Q) are logi- 


cally equivalent. 


. Write the negation of the following statements. 


(a) August is a hot month and September is sometimes cool. 

(b) Every member of the baseball team is complaining and not hitting. 
(c) Some cars are comfortable and not expensive. 

(d) Math tests are long or difficult. 


. Write the negation of the following statements. 


(a) If x and y are real numbers such that xy = 0, then x = Oor y = 0. 
(b) For every integer x, x’ is odd and x* — 1 is divisible by 4. 

(c) Vn, n an integer, J an integer k such that n = 2k orn = 2k + 1. 
(d) da rational number r 3 1 <r <2. 

(e) A real number can be greater than 2 or less than 1. 

(f) Some functions are neither differentiable at 0 nor continuous at 0. 


. Let P be the statement “Every multiple of 6 is even and is not a multiple 


of 4’ of Example 11 in this section. 

(a) Write P in the form, ‘‘for all... , if... , then. ...’’ Use variables. 
(b) Write the negation of P. Use variables. 

(c) Prove P or —P. 


. Repeat Exercise 6 if P is the statement “If the product of two integers 


is even, then both of the integers are even.” 


. Let n,m € Z. Write the negation of the statement “‘Exactly one of the 


integers n or m is odd.” 


. Consider the following statement P: “If n is an odd integer, then there 


exists an integer x such that n = 4x + lorn = 4x + 3.” 
(a) Write the negation of P. 
(b) Prove P or —P. 


Let P and Q be statement forms. 
(a) Prove that P A Q is logically equivalent to Q a P. 
(b) Prove that P v Q is logically equivalent to Q v P. 


Let P(x) and Q(x) be open sentences containing the variable x. In each 
part of this problem, determine if the given statements S and T are 
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15. 
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logically equivalent. If they are, give a proof. If they are not, give an 
example of open sentences P(x) and Q(x) for which S and T are not 
logically equivalent. 
(a) S: Wx, (P(x) A Q(x). 

T: (Wx, P(x)) a (Wx, Q(x)). 
(b) S: Ax 3 (P(x) A Q(x)). 

T: (Ax 3 P(x)) A (Ax 3 Q(x)). 
Let P, QO, and R be statement forms. Write the truth tables for the 
following statement forms. 
(a) (P A -=Q) v (AR). (b) (AP) A (R A (=Q)). 
(Reminder: To do this problem, it is necessary to list all possible combinations 
of truth values for P, Q, and R. There are eight of them.) 


Let P, O, and R be statement forms. 

(a) Prove that (P v Q) v R and P v (Q v R) are equivalent state- 
ment forms. 

(b) Prove that (P ~ Q) A R and P ~ (Q ~v R) are equivalent state- 
ment forms. 

(c) Prove that P v (Q 4 R) and (P v Q) a (P v R) are equivalent 
statement forms. 


Characterize all real numbers x such that x > 1 or |x| < 3. Express 
your answer in the simplest possible way without using absolute value 
signs. 


Let P, O, R, and S be statement forms. 

(a) Prove that Pv ((Q A, R) A S)) and (Pv Q) a (Pv R) a (Pv S) are 
equivalent statement forms. 

(b) Prove that P ~ ((Q v R) v S)) and (Pa Q) v (PA R) v (P. 4 S) are 
equivalent statement forms. 

Let P and Q be statement forms. 

(a) Prove that (P AQ) v (Pa-=Q) v(APA~ Q) v (AP A-=Q) isa tautology. 

(b) Prove that (P v Q) a (P v -=Q) a (AP v Q) « (AP v -Q) is a 
contradiction. 


Discussion and Discovery Exercises 


D1. Consider the following ‘“‘proof” that if m or m is an odd integer, then 


nm Is an even integer. 


Suppose that n is odd and m is even. Then m = 2t for some integer 
t. Therefore nm = n(2t) = 2nt, which is even. Next suppose that n 
is even and m is odd. We can write n = 2s for some integer s. Thus 
nm = 2sm, which is also even. In both cases, we get that nm is even. 
Thus the statement is proved. 


Is the proof valid? If you think it is not, explain. 
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D2. 


D3. 


D4. 


D6. 


Consider the following two statements: 


P: For every real number x, x’ = 0. 
Q: Lyndon Johnson was elected president in 1964. 


Are these statements logically equivalent? Explain. 


Using the five statements below as clues, match Sarah, Ann, and Bob 
with their respective occupations (teacher, entomologist, or poet) and 
the color of their houses (brown, white, or green). Explain how you 
arrived at your answer and mention any rules of logic that you use. 
Assume that no two people have the same occupation or the same color 
house. 
The first three clues are true statements: 

1. Sarah or Ann is the poet. 

2. Sarah’s or Bob’s house is green. 

3. Ann’s house is green or white. 
The next two clues are false statements: 

4. The teacher’s house is green. 

5. Sarah is the poet or Bob is the entomologist. 


Consider the following two statements: 


P: _ For every even integer n, there is an odd integer m such that 
n + mis odd. 
Q: — There is an odd integer m such that for every even integer 


n,n + mis odd. 


(a) Do these statements mean the same thing? If not, explain the dif- 
ference. 

(b) Write the negations of P and Q. 

(c) Discuss the truth or falsity of statements P and Q. Give proofs. 

(d) Are these statements logically equivalent? Give reasons for your 
answer. 


. Repeat the previous problem for the following statements: 


P: For every real number x between 0 and 1, there is a real 
number y between 1 and 2 such that x + y < 2. 
Q: — There is areal number y between 1 and 2 such that for every 


real number x between 0 and 1,x + y < 2. 


Euclid’s fourth postulate says: ‘‘All right angles are equal to one an- 
other.” Isn’t it obvious that all right angles are equal? What do you 
think Euclid meant by right angle? Why does he consider it necessary 
to include this postulate? 
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IMPLICATIONS 


Implication: Definition and Examples 


In mathematics, as we have noted before, we are interested in whether a 
given statement is true or false. Usually such a statement will be proven true 
(or false) because it (or its negation) can be seen to follow logically from 
prior statements that we know to be true. Those prior statements may be 
propositions already proven or they may be axioms. Axioms are statements 
that we take to be given without proof and serve as the starting point of a 
particular subject. In Chapter 5, for example, we will list a number of axioms 
of the integers and then deduce properties of the integers from them. 

In the previous two sections, many of the statements we looked at took 
the form: “If..., then...’ or “For all...,if...,then....’ The “‘if-then”’ 
part of such a statement is called an implication. Most theorems will take 
this form. In Example 1 of Section 1.1, we considered the statement: “If n 
is an even integer, then n’ is even.” The “‘if”’ part of the statement gives the 
premise or assumption that is made. In this example, we assume that n is 
an even integer. The “‘then”’ part is the conclusion that is asserted to follow 
from the premise. The statement ‘‘n’ is even” is asserted to be a true statement 
under the assumption that ‘‘n is even.” So in order to prove such a statement 
as “if n is an even integer, then n’ is even,” one starts, as was done in 
Example 1 of Section 1.1, by assuming that n is an even integer and then 
proves that n’ is even. 

We begin with a formal definition of implication and then do some ex- 
amples. 


Definition 1.3.1 Let P and Q be statements. The implication P > Q 
(read ‘‘P implies Q’’) is the statement “If P is true, then Q is true.” 


P; 3 +2 = 5, 
QO: 3+1+4+1=5. 
P>Q: If3+2=5,then3+1+1=5S. 
P: 4 is an even integer. 
Q: 4 is an even integer. 
P>@Q: If 4 is an even integer, then 4’ is an even integer. 
P: — The function f(x) = x? is differentiable at 0. 
Q: The function f(x) = x? is continuous at 0. 
P=>@Q: | If the function f(x) = x? is differentiable at 0, then it is 


continuous at 0. 
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Truth Table for an Implication 


Before we consider when an implication might be true, an important distinc- 
tion must be made. For a mathematician, there is no sense of causality in 
the statement P > Q; we leave this question to philosophers. P might be 
(apparently) entirely unrelated to Q. 

Rather, the statement simply means that in all circumstances under 
which P is true, Q is also true. Very loosely, whenever P “‘happens,” Q also 
‘“‘*happens’’; we don’t care whether P seems to cause Q or not. 

With this in mind, when would P => Q be false? We would need P to 
“happen” (P true) and Q not to “‘happen”’ (Q false). This is the only case. 
Seen another way, P => Q can’t be false (so must be true) if P is false. Even 
if P and Q are both false the implication P > Q is true. If P is true, though, 
Q must also be true for P > Q to be true. 

In summary, whether P => Q is true or false depends only on the truth 
or falsity of P and of Q, so there are four cases, just as with conjunction and 
disjunction. The following table gives the truth values of the statement form 
P>@Q. 


P i) P=0O 
T T T 
T F F 
F T T 
F F T 
P: 3+2=5 


In this case, both P and Q are true, so P > Qis true as well. Intuitively, 
P ‘“‘forces” Q to be true, since 2 = 1 + 1, but the truth of the two statements 
is sufficient. 


P: Gerald Ford was vice president under Jimmy Carter. 
Q: 2<7. 


Here, P is false and Q is true, so P > Q is true. Notice that if R is the 
statement 7 < 2, P = R is also true; a false statement implies anything! 


P: The American Revolution ended in 1781. 
Q: George Washington served three terms as president. 


P is true and Q is false; however related the statements might be, the 
statement P => Q is false. 


EXAMPLE 7 


EXAMPLE 8 


EXAMPLE 9 


1.3 IMPLICATIONS 31 


P: 41s an even integer. 
Q: 4 is an even integer. 


Since P and Q are both true, P > Q is true. Example 1 of Section 1.1 
does imply that there is a causality between statements P and Q but the 
mere fact that P and Q are both true statements is sufficient to establish 
that P > Q is true. 


P: — The function f(x) = x’ is differentiable at 0. 
Q: The function f(x) = x? is continuous at 0. 


Both P and Q are true statements verified in calculus. So the implication 
P = Q is true. There is a causality between P and Q, since a theorem 
from calculus shows that differentiability implies continuity. But again this 
causality, although it is certainly helpful to use, is not needed to establish 
the truth of P> Q. 


P: The function f(x) = x is continuous at 0. 
Q: The function f(x) = x’ is differentiable at 0. 


Here there is no causality between P and Q since continuity does not in 
general imply differentiability. Nevertheless, the implication P > Q is true 
since P and Q are both true. 


Proving Statements Containing Implications 


Most often we will be interested in establishing the truth of (proving) state- 
ments of the form Vx, P(x) = Q(x) where P(x) and Q(x) are open sentences. 

Since P(x) and Q(x) are not statements, the expression P(x) > Q(x) is 
not a statement either. P(x) > Q(x) is the open sentence in the variable x 
that becomes the statement P(a) = Q(a) when x is assigned the value a. 
But recall that the expression Vx, P(x) > Q(x) is a statement because the 
variable x has been quantified. 

(For simplicity of notation, we assume there is only one variable involved 
but in fact the same principles apply no matter how many variables are 
being used.) 

Since, for an assigned value a of x, the statement P(a) > Q(a) will always 
be true if P(a) is false, we need not consider this case. Rather, we can assume 
that, if the variable x is assigned a value a, then P(a) is true and proceed 
from there to prove Q(a). P(a) is called the hypothesis, and Q(a) the conclu- 
sion. We use a letter, in this case a, to denote the assigned value of the 
variable x rather than give a specific value to x like 0 or 2, since we must 
prove that P(a) = Q(a) for every possible assigned value of x. Here are 
some examples. 
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The statement ‘“‘The square of every even integer is even” can be written 
“Tf n is an even integer, then n’ is even.” This statement contains a universal 
quantifier. Recall from Example 6 of Section 1.1 that the sentence can be 
reworded ‘For every integer n, if n is even, then n’ is even.” This statement 
has the form V integers n, P(n) > Q(n) where P(n) is the open sentence 
“nn is even” and Q(n) is the open sentence ‘“‘n’ is even.” 

The proof of this statement would start as follows: Let a be an integer 
and suppose a is even. The statement “‘a is even” is the hypothesis. We are 
thus starting out by asserting the truth of the hypothesis P(a). The conclusion 
is the statement “‘a* is even.”” The remainder of the proof involves verifying 
that the statement Q(a) is now true. The details of this proof were done in 
Example 1 of Section 1.1. 

When we start this proof by saying ‘‘Let a be an integer and suppose a 
is even,” we are assigning the variable n an even integer value but without 
specifying exactly what that value is. The letter a thus stands for any even 
integer and once it is proven that a’ is even, the truth of the statement 
P(a) > Q(a) is established for every integer a. 


Note: In practice, it is usually simpler to use the same letter for the variable and its 
assigned value. So we could use just the letter in the proof of Example 1@ without 
confusion. But remember, once we say “‘Let n be an integer,” then P(n) is a statement. 


Prove or disprove the statement S: “If n and m are odd integers, then n + 
m is even.” 
This statement can be reworded 


For all integers n and m, P > Q 


where P is the open sentence ‘“‘n and m are odd integers” and Q is the open 
sentence: ‘“‘n + m is even.” 

Note that statement S contains two variables. We should more properly 
write P(n, m) and Q(n, m), since P and Q depend on the variables n and 
m, but for the sake of simplicity of notation, we will simply write P and Q. 

Because the truth or falsity of S is not immediately clear, we might begin 
by making a conjecture as to whether or not it is true. A good way to start 
is to try a few examples. Our hypothesis is that n and m are odd integers, 
so we'll try some odd integers and see if their sums are even or odd: 


3+5=8, 11 + 17 = 28, 59 + 43 = 102. 


In each of these cases, the sum is even. These examples do not prove 
that the statement is true since the statement contains a universal quantifier. 
We must prove that for all odd integers n and m, the sum n + m is even. 
But it certainly seems reasonable to conjecture that S is true. 

Now let’s prove it! 

To begin our proof, let n and m be odd integers. This means that we can 
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write n as 2t + 1 and mas 2s + 1, where ¢ and s are integers. Note that if 
we write n = 2t + 1, then we cannot write m = 2t + 1 since n and m are 
not necessarily the same integer. We must use another letter besides t and 
so we arbitrarily chose the letter s. Thus 


n+m = (2t+ 1) + (25+ 1) = 2t+25+2 = 2t+5 +1) 


and since t + s + 1is an integer, m + mis even. The proof is complete. 


Note that we started our proof by letting m and m be odd integers. The 
variables n and m stand for any pair of odd integers, not specific ones. So 
the fact that n + m is even has been established for all pairs of odd integers 
n and m. This method of proof is necessary since the statement has a univer- 
sal quantifier. 


Negating an Implication: Counterexamples 


Sometimes a statement containing an implication may be false. In that case, 
it becomes necessary to state its negation in a coherent fashion in order to 
prove that it is false. 


Consider the statement “If a real-valued function fis continuous at 0, then 
f is differentiable at 0.” This statement from calculus is false. To see how 
to write its negation, we break it down into its simple components. Let P(f) 
be the open sentence ‘“‘fis continuous at 0” and Q(f) the open sentence “‘f 
is differentiable at 0.” (Note that the variable in this example is f.) So our 
statement is “For every real-valued function f, P(f) > Q(f).” As we saw 
in Section 1.1, the negation must be the statement ‘There is some real- 
valued function f such that =(P(f) = Q(/)).” 

As we have seen, if P and Q are statements, then the only way for P > 
Q to be false is that P be true and Q be false. Thus the negation must take 
the form ‘‘There is an assigned value of the variable f, say f = g, such that 
P(g) is true and Q(g) is false.” More simply, using just the letter f, “There 
is a real-valued function f that is continuous at 0 but not differentiable at 
0.’’ Since the absolute value function f defined by f(x) = |x| is such a function, 
the proof is complete. This function is called a counterexample because it 
serves to disprove a statement with a universal quantifier. 


Given the previous example, it should not be surprising that if P and Q 
are statement forms, then =(P => Q) is logically equivalent to P ~ AQ. 
We leave it as an exercise to show that these statements have the same 
truth tables. 

It follows then that if P(x) and Q(x) are open sentences, the negation of 
the statement ‘For all x, P(x) > Q(x)” is the statement ‘“‘There exists x such 
that P(x) ~ =Q(x),” or in other words, ‘For some x, P(x) is true and Q(x) 
is false.” 
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The value assigned to the variable x that makes P(x) true and Q(x) false 
is called a counterexample to the statement “For all x, P(x) > Q(x).” 
Note that the negation of an implication is not an implication! 


Consider the statement ‘‘The sum of two perfect squares is a perfect square.” 
To prove or disprove this statement, let’s first write it as a statement that 
has a universal quantifier and contains an implication: “‘For every pair of 
integers n and m, if n and m are perfect squares, then n + m is a perfect 
square.” Symbolically, we can write this as “W integers n and m, P > Q” 
where P is the open sentence ‘‘n and m are perfect squares” and Q is the 
open sentence ‘‘n + m is a perfect square.”’ The negation of this statement 
would be “There exist integers n and m such that P ~ —=Q.” In other words, 
“There exist integers n and m such that n and m are perfect squares but 
n + mis not a perfect square.” 

Since 4 and 9 are perfect squares, but 4 + 9 = 13s not a perfect square, 
the negation must be true and the original statement false. Again we have 
used a counterexample to disprove a statement with a universal quantifier. 


Necessary and Sufficient Conditions 


Given statements P and Q, the implication P => Q means, as we have seen, 
that if P is true, then Q is true. We say then that P is a sufficient condition 
for Q. In other words, in order for Q to be true, it is sufficient that P be true. 

Also, if P > Q is a true statement, we say that Q is a necessary condition 
for P, meaning that Q must be true in order for P to be true. In other words, 
if Q is false, then P is false. The statement —Q => —P is logically equivalent 
to P = Q. This equivalence will be discussed in the next section. 

Note however that if P > Q is a true statement, then it is not necessary 
that P be true in order for Q to be true. Similarly, Q is not a sufficient 
condition for P. In other words, even if Q is true, P may be false. 


Let x be areal number. Let P be the statement “‘x > 5’’ and Q the statement 
“x > 0.” It is clear from properties of inequalities of real numbers that 
P = Q is true. So x > 5 is a sufficient condition for x > 0. But it is not a 
necessary condition. x need not be greater than 5 for x to be greater than 
0. On the other hand, x > 0 is a necessary condition for x > 5; that is, if 
x > Ois false, then x > 5 is false. We can also easily see that x > 0 is not 
a sufficient condition for x > 5. 


Let f be a real-valued function. It follows from Example 12 that f being 
continuous at 0 is a necessary but not sufficient condition for f to be differen- 
tiable at 0. On the other hand, f being differentiable at 0 is a sufficient 
but not necessary condition for f to be continuous at 0. 
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#e HISTORICAL COMMENTS: LOBACHEVSKIAN GEOMETRY 


In Section 1.2 we discussed the attempts to prove Euclid’s fifth postulate and 
how all such attempts failed. Then, in the 19th century, three mathematicians, 
Nicolai Lobachevsky (1793-1856), Johann Bolyai (1802-1860), and Carl 
Friedrich Gauss (1777-1855) arrived at the same conclusion independently: 
that Euclid’s fifth postulate was indeed an independent axiom and could not 
be deduced from the other axioms. This assertion followed from the realiza- 
tion that by altering Euclid’s fifth postulate, a new consistent geometry could 
be created. Their revised postulate was the following: given a point P and 
a line L that does not contain P, there is more than one line that passes 
through P and is parallel to L (that is, does not intersect L). From this 
postulate, together with the other Euclidean axioms, a new non-Euclidean 
geometry was born. This geometry has come to be known as Lobachev- 
skian geometry. 

Lobachevskian geometry has some interesting and surprising theorems. 
In this geometry, the sum of the angles of any triangle is /ess than 180 degrees 
and if two triangles are similar, then they must be congruent. 

A model for Lobachevskian geometry is the interior of a circle where 
“lines” are defined to be chords of the circle with the endpoints excluded. 
In the accompanying figure, we see that we can draw infinitely many lines 
through the point P that do not intersect the line L. 


Qe 


ee 4 


Exercises 1.3 


1. Express the following statements in the form “‘for all... , if... then 
... using symbols to represent variables. Then write their negations, 
again using symbols. 

(a) Every hexagon has 6 sides. 

(b) An integer is odd or even. 

(c) A function that is differentiable at 0 is continuous at 0. 
(d) All positive real numbers have a square root. 


2. Repeat Exercise 1 for the following statements. 
(a) All angles of an equilateral triangle are equal. 
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12. 


13. 


14. 


(b) 1 is the smallest positive integer. 
(c) When the product of two integers is even, then both integers are even. 
(d) Between any two real numbers there is a rational number. 


. Let P and QO be statements. 


(a) Prove that —(P => Q) is logically equivalent to P ~ =Q. 

(b) Prove that =(P > Q) is not logically equivalent to “P > —Q. 

(c) Give an example of statements P and Q such that 4P > —Q is true 
and —(P => Q) is false. 

(d) Explain why it is impossible to give an example of statements P and 
Q such that =(P > Q) is true and —P => —(Q is false. 


. Consider the following statement: ‘““The product of an even integer with 


any integer is always even.” 

(a) Rewrite the statement in the form ‘‘for all... , if... , then... 
using symbols to represent variables. 

(b) Write the negation of the statement, again using symbols. 

(c) Prove the statement if you think it is true or disprove it if you think 
it is false. 


” 


. Repeat Exercise 4 for the statement “‘The cube of an odd integer is odd.”’ 


. Repeat Exercise 4 for the statement “‘The sum of the squares of two 


consecutive integers is odd.” 


. Repeat Exercise 4 for the statement ‘““The sum of the squares of three 


consecutive integers is even.” 


. Repeat Exercise 4 for the statement: “If the square root of a positive 


integer is an even integer, then the integer itself is even.” 


. Prove that the statement form (P => Q) v (Q = P) is a tautology. 


. Let P, OQ, and R be statements. Prove that if the statement P > Q is 


true and the statement QO = R is true, then the statement P = R is true. 


. Let P(x), Q(x), and R(x) be open sentences containing the variable x. 


Prove that if the statement (Vx(P(x) > Q(x))) a (Wx(Q(x) = R(x))) is 
true, then the statement Vx(P(x) => R(x)) is true. 


Let P, QO, and R be statement forms. Prove that the statement form 
P= (Qv Rk) is logically equivalent to the statement form (P A =Q) => R. 


Let P, QO, and R be statement forms. Is the statement form P > (Q v 
R) logically equivalent to the statement form (P > Q) v (P = R)? 
Justify your answer. 


Let P, Q, and R be statement forms. Prove that the statement form 
P => (Q~ R) is logically equivalent to the statement form (P > Q) a 
(P = R). 


15. 


16. 


17. 


18. 
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Let x and y be real numbers. Consider the statement S: “If xy = 0, then 

x =Oory =0.” 

(a) Show that S can be expressed in the form P > (Q v R) for appropriate 
statements P, Q, and R. 

(b) Rewrite statement S using its logically equivalent form (P A =Q) > 
R. (See Exercise 12.) 

(c) Using the revised form of statement S from part (b) and some 
properties of the real numbers, prove statement S. 


Let P(x), Q(x), and R(x) be open sentences containing the variable x. 
If the statement (4 x 3 (P(x) => Q(x))) A (A x 3 (Q(x) > R(x))) is true, 
does it follow that the statement 4 x 3 (P(x) => R(x)) is true also? If 
you think it does, give a proof. If you think it does not, give an example 
of open sentences P(x), Q(x), and R(x) for which the statement 
(A x 3 (P(x) > Q(x))) A A x 3 (Q(x) => R(x))) is true and the state- 
ment J x 3 (P(x) => R(x)) is false. 


Let n be an integer. Let P be the statement ‘‘n is a multiple of 4” and 
Q be the statement ‘“‘n? is a multiple of 4.’ Which of the following are 
true statements? Prove your answers. 

(a) P is a sufficient condition for Q. 

(b) P is a necessary condition for Q. 

(c) Qisa sufficient condition for P. 

(d) Q is a necessary condition for P. 


Let n, m € Z. Repeat Exercise 17 where P is the statement ‘‘n and m 
are odd integers” and Q is the statement ‘“‘n + mis even.” See Example 
11 of this section. 


Discussion and Discovery Exercises 


D1. 


D2. 


D3. 


In Example 11, the phrase “‘we arbitrarily chose the letter s’’ is used. 
Write a short explanation of what the word “arbitrarily’’ means in 
this context. 


Write out the truth table for —P = —@Q and show that it is logically 
equivalent to P v —Q. Write out an informal explanation of why this 
equivalence makes sense. 


(a) Prove that if n is a positive integer, then 1 + 24+ 27+ 27>+...+ 
2h Det 1, 

(b) Suppose that you place a piece of paper on a table, then place 2 
pieces on top of it, then 4 pieces on top of that, then 8 pieces, then 
16 pieces, and so on, doubling the number of pieces each time. 
Assuming that you do this doubling 50 times, how many pieces of 
paper are in the pile? 
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(c) Assuming that each piece is 1/200th of an inch thick, estimate how 
tall the pile is. 

(d) Use a calculator to give a good approximation of the size of the 
pile. Express your answer in inches, feet, or miles, whatever seems 
most appropriate. How good was your estimate? 

(e) How does this problem fit the theme of this chapter? Explain. 


D4. The ancient Egyptians were able to find a formula for the volume of a 
truncated pyramid with square base and square top. Although we don’t 
know how they arrived at the formula, it is an ingenious application of 
inductive reasoning and perhaps of deductive reasoning as well, if in 
fact they were able to prove their result. If the base has side b, the top 
has side a, and the vertical height is h, find the formula for the volume 
V of the truncated pyramid. Justify your answer. (Recall that the volume 
of a full pyramid is one-third the area of the base times the height.) 


CONTRAPOSITIVE AND CONVERSE 


Contrapositive 


An alternative way to prove statements of the form P > Q is to verify the 
statement —Q => —P. This would mean showing that whenever Q is false 
then P is false. Then it would follow that if P is true, Q could not be false 
(because Q false implies P false) and thus Q is true. Another way to see 
this is to note that the statements P > Q and =Q = -—P are logically 
equivalent. We leave it as an exercise to verify that they have the same 
truth tables. 

Besides the formalism of showing that the truth tables are the same, it 
is important to understand in an intuitive way why these statements are 
logically equivalent. It should make sense that they mean the same thing. 
Let’s take a nonmathematical example. 


Let P be the statement “‘John lives in Boston” and Q be the statement ‘John 
lives in Massachusetts.” The implication P > Q is the statement “If John 
lives in Boston, then John lives in Massachusetts.”’ This is a true statement. 
(We'll assume we mean Boston, Massachusetts.) The implication ~Q => —P 
is the statement “If John does not live in Massachusetts, then John does not 
live in Boston.”’ It means the same as P > Q. 


Definition 1.4.1 The statement —Q = —P is called the contrapositive 
of the statement P => Q. 
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Implication: If it rained today, then the game was cancelled. 
Contrapositive: If the game was not cancelled, then it did not rain 
today. 


Implication: Ifx +1> 5, thenx > 4. 
Contrapositive: Ifx =4,thenx +155. 


The following example illustrates why it is often better to prove an 
implication by proving its contrapositive. 


Prove that if n is an integer and n’ is even, then n is an even integer. 

We'll start by trying to prove this statement directly. Let n be an integer 
such that n? is even. We wish to show that n is even. Because we are assuming 
that n’? is even, we can write n’ as twice an integer. This means we can write 
n’ = 2t for some integer ¢. 

Because we want to show that n is twice an integer, the obvious thing 
to do is to take the square root of both sides of this equation. This gives 
n = V2t. What conclusion can we draw from this equation? Unfortunately, 
not much at all because the square root of an integer is not necessarily an 
integer itself. Thus this approach to proving our statement has hit a dead end. 

Now we’ll try proving the contrapositive of our statement instead. The 
first step is to write the contrapositive in a coherent fashion. 

The statement we are trying to prove is an implication P > Q where, 
under the assumption that 1 is an integer, P is the statement ‘‘n’ is even” 
and Q is the statement ‘‘n is even.’ Thus —P is the statement ‘‘n’ is not 
even” or ‘“‘n* is odd” and —Q the statement ‘‘n is not even” or ‘“‘n is odd.” 

Therefore, the contrapositive of P => Q is the statement “If n is odd, 
then n’ is odd.” It is this statement that we will now try to prove. 

If n is odd, then we can write n = 2t + 1 for some integer ¢. Squaring 
both sides gives us 


m= (2t4+ 1) = 4 + 4¢4+ 1 = 2(2P + 21) +1 


and because 27? + 2r is an integer, it follows that n’ is odd. The proof of the 
contrapositive is now complete. 

In proving —Q => —P, we also have proved P => Q, because these two 
statements are logically equivalent. We have now completed the proof of 
the statement: “If n is an integer and n’ is even, then n is even.” 


Converse 


Recall that in Example 1 of Section 1.1, we proved that if 1 is even, then n’ 
is even. Now in this last example we have proven that if n’ is even, then n 
is even. In symbolic language, if P is the statement ‘‘n is even” and Q is the 
statement ‘‘n’ is even’ (the reverse of the notation used in the previous 
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example), then Example 1 of Section 1.1 proves that P = Q and Example 
2 above proves that QO => P. 


Definition 1.4.2 The statement Q = P is called the converse of the 
statement P > Q. 


Implication: If it rained today, then the game was cancelled. 
Converse: _ If the game was cancelled, then it rained today. 


Implication: Ifx+1> 5, then x > 4. 
Converse: Ifx >4,thenx +1>5. 


It is important to note that an implication and its converse are NOT 
necessarily logically equivalent. The truth of one does not imply the truth 
of the other. In Example 5, even if the implication is true, the converse could 
be false. The game could be cancelled for a different reason. On the other 
hand, the statement “If 1 is even, then n? is even” and its converse “If n? is 
even, then n is even” are both true. However, one does not follow logically 
from the other. Two different mathematical proofs are required. 


In Example 11 of Section 1.3, we proved that if n and m are odd integers, 
then n + mis even. The converse of this statement is: if n and m are integers 
and n + mis even, then n and m are odd. But this converse is false because 
4 + 2 is even but neither 4 nor 2 is odd. 


Biconditional 


When a statement P => Q and its converse Q = P are both true, then P 
and Q are either both true or both false. Hence P and Q are logically 
equivalent. Using the terminology introduced at the end of Section 1.3, we 
can say that P is a necessary and sufficient condition for Q. 


Definition 1.4.3 Let P and Q be statements. The statement P = Q (or 
P iff Q, read P if and only if Q) is the statement (P = Q) a (Q => P). 
The symbol <¢ is called the biconditional. 


Combining Example 1 of Section 1.1 and Example 4 in this section, we 
have the following result, which we label as a theorem. 


Theorem 1.4.4 


Theorem 1.4.5 


Theorem 1.4.6 


Theorem 1.4.7 
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Let n be an integer. Then n is even if and only if 1? is even. 
This theorem may also be worded this way: 
Let n be an integer. Then the following are equivalent statements: 


1. n is even. 
2. n? is even. 


Notice that proving an equivalence requires two proofs! 


Sometimes a mathematical statement or proposition may seem like a 
new result when in fact it merely follows logically from an earlier proposition 
or theorem. For example, if P and Q are statement forms, then —P © —=Q 
is logically equivalent to P < Q. The proof of this fact can easily be seen 
by comparing their truth tables. The details are left as an exercise. The 
following theorem is a consequence of this equivalence. 


Let 1 be an integer. Then n is odd if and only if n? is odd. 


PROOF: Given an integer n, let P be the statement “‘n is odd” and Q be the 
statement ‘“‘n? is odd.”’ The theorem has the form Vn, P © Q. Because P& 
Q is logically equivalent to —P © —Q, it suffices to prove Wn, =P @& —Q. 
But =P = —Q is the statement ‘“‘n is even if and only if n’ is even,” proved 
in Theorem 1.4.4. The proof is now complete. * 


Note: The symbol «. used at the end of the last proof will be used throughout 
the text to signify that a proof has been completed. 


Proof by Contradiction 


Recall that in order to prove an implication P > Q, we can prove its 
contrapositive —Q = —P instead. Since P is our hypothesis or premise and 
Q is the conclusion that we wish to deduce from P, in effect what we are 
doing is assuming that our conclusion Q is false and proving that our premise 
P is false, thus getting a contradiction. The contradiction is that P and —P 
cannot both be true. This approach is often called proof by contradiction. 
It can work in other ways as well. In order to prove a statement S, for 
example, we could assume that S is false and deduce a statement that we 
know is false; a statement like ‘0 = 1” or “‘the square of a real number is 
negative.” The justification of this method of proof is given by the following 
theorem, followed by two examples. 


Let S be a statement and let C be a false statement. Then the statement 
aS = C is logically equivalent to S. 


PROOF: If S is true, then —S is false and therefore the implication —S > C 
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is true. On the other hand, if S is false, then —S is true and since C is false, 
the implication —S = C is false. Thus the statements S and —S => C are 
either both true or both false. Hence they are logically equivalent. «* 


Prove that there are no integers x and y such that x? = 4y + 2. 

To prove this by contradiction, we let S be the statement ‘‘There are no 
integers x and y such that x? = 4y + 2.” This statement can be reworded 
“For all integers x and y, x* 4 4y + 2.” 

Now we assume that S is false; then there exist integers x and y such 
that x? = 4y + 2. 

Since x? = 2(2y + 1), x? is even. Hence by Theorem 1.4.4, x is even also. 
So we can write x = 2n for some integer n. Therefore x? = 4n? = 4y + 2. 

Subtracting and factoring, we get: 4(n’? — y) = 2 or n’® — y = 3. Because 
n’ and y are integers, their difference must be an integer also. Now we 
have a contradiction, namely the statement that 3 is an integer. Since the 
assumption that S is false leads to a false statement, Theorem 1.4.6 implies 
that S must be true. (Note: the fact that 3 is not an integer can be deduced 
from the axioms for the integers discussed in Chapter 5.) Our proof by 
contradiction is now complete. 


We will prove by contradiction that V2 is not a rational number. The asser- 
tion that V2 is not a rational number can be expressed: if r is a rational 
number, then r’? # 2. Suppose, on the contrary, that V2 is rational. Then 
there is a rational number r such that r? = 2. Thus we can write V2 = a/b 
where a and 5D are integers and a/b is reduced to lowest terms. This means 
that a and b have no common factors other than 1 and —1. (Note that we 
are assuming a fact about rational numbers, namely that any rational number 
can be expressed in such a reduced form.) Squaring both sides of the above 
equation and multiplying through by b? gives a? = 2b’. It follows that a’ is 
even and so by Theorem 1.4.4, a is even. So a = 2c, for some integer c. We 
get a’? = 4c’ = 2b’ or 2c’ = b’, which implies that b’ is even and thus, again 
by Theorem 1.4.4, that b is even. This is a contradiction, since we assumed 
that a/b was reduced to lowest terms, and therefore a and b can have no 
common factors other than 1. Thus our assumption that V2 is a rational 
number must be false and we can conclude that V2 is not a rational number. 


“~ HISTORICAL COMMENTS: RIEMANNIAN GEOMETRY 


In proposing an alternative to Euclid’s fifth postulate, Lobachevski, Bolyai, 
and Gauss assumed that given a point P and a line L that does not intersect 
P, there is more than one line that passes through P and is parallel to L. 
Now suppose that the Parallel Postulate were replaced by the following 
axiom: given a point P and a line L that does not intersect P, there is no 
line that passes through P and is parallel to L. 

Mathematicians like Lobachevski and Bolyai believed that this revision 
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of the Parallel Postulate was not valid because they could derive a contradic- 
tion from it. But in doing so they assumed that Euclid’s second postulate 
“‘a finite straight line can be produced continuously in a straight line” meant 
that a line has infinite length. In fact, Euclid’s second postulate only implies 
that a straight line has no boundary. It does not mean that a line must be 
of infinite length. 

In 1854, in a celebrated lecture given to the faculty at the University of 
G6ttingen in Gemany, Bernhard Riemann (1826-1866) proposed an ambi- 
tious program of study that led to a revolution in geometry. As one facet 
of this program, Riemann pointed out the distinction between a line being 
without boundary and a line having infinite length. By allowing a “‘line”’ to 
have no boundary but at the same time have finite length, Riemann was 
able to describe another non-Euclidean geometry, different from Loba- 
chevski’s. In this geometry, Euclid’s fifth postulate is replaced by the axiom 
previously mentioned: through a given point P not on a line L there is no 
line that is parallel to L. In other words, all lines intersect! 

In what has come to be called Riemannian geometry, it can be shown 
that if perpendicular lines are drawn on the same side of a given line L, 
then these lines will all intersect in a common point Q and the lengths of 
these perpendiculars from Q to L will all have the same length a. Moreover, 
all lines in this geometry have finite length 4a. 

One model of Riemannian geometry is the surface of a sphere. On 
a sphere the shortest distance between two points is the arc of a great 
circle. By a great circle we mean a circle whose center is the center of 
the sphere. The great circles then are the “‘lines” of the geometry of the 
sphere. Note that these lines are finite in length, their lengths equal the 
circumference of the sphere, but have no boundary; that is, they have 
no beginning or end. 


Note also that the larger the sphere is, the larger are the lengths of the 
lines in this geometry. At the same time, the larger the sphere the more the 
geometry resembles Euclidean geometry. Imagine the surface of the earth 
as a sphere, even though it is not exactly that shape. To observers on the 
earth, the surface appears flat and straight lines seem to extend indefinitely 
and be infinite in length. But we know that any line drawn on the earth will 
eventually circle the earth and come back on itself. By contrast, if the spheri- 
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cal surface is a ball of radius 3 inches, then any line on the ball more than 
an inch long will definitely look curved. The difference between the surface 
of the earth and the ball is “curvature.” Curvature is a mathematical term 
that measures how much a surface bends or curves. The curvature of the 
plane is 0 and the curvature of a sphere is 1/r where r is its radius. So the 
greater ris, the closer the curvature is to 0 and the more closely the geometry 
appears to be Euclidean. 


Exercises 1.4 


1. Let P and Q be statement forms. Prove that ~Q => —P is logically 
equivalent to P > Q. 


2. Let P and Q be statement forms. 
(a) Prove that (P A —Q) => Q is logically equivalent to P > Q. 
(b) Explain why this logical equivalence makes sense. 


3. Let P and Q be statement forms. 
(a) Prove that (P ~ =Q) => —P) is logically equivalent to P > Q. 
(b) Explain why this logical equivalence makes sense. 


4. Write the contrapositive and converse of the following statements: 
(a) If the power goes off, the food will spoil. 
(b) If the light is on, then the door is not locked. 
(c) If it is 9 a.M., then we started the test. 


5. Let x be a real number. Write the contrapositive and converse of the 
statement “If x? — x — 2 = 0, then x = —1 or x = 2.” 


6. Let n and m be integers. 
(a) Prove the statement “If and m are even, then n + m is even.” 
(b) State the contrapositive. 
(c) Prove or disprove the converse. 


7. Let f be a real-valued function. Write the contrapositive and converse 
of the statement “If fis differentiable at 0, then fis continuous at 0.” 


8. Professor Know-It-All tells Jonathan: “If you get at least a B on the 
final exam, then you will pass the course.” Jonathan passes the course. 
What can he conclude? 

(a) He got at least a B on the final. 
(b) He cannot conclude anything. 
Give reasons for your answer. 


9. The professor tells Karen: “It is necessary that you get at least a B 
on the final in order to pass the course.” Karen gets a B. What can 
she conclude? 

(a) She passed the course. 


10. 


11. 


12) 


13. 


14. 


15. 


16. 


17. 
18. 
19. 


20. 
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(b) She can’t conclude anything. 
Give reasons for your answer. 


The professor tells Michael: “If you get at least a C on the final exam, 
then you will pass the course.”’ Michael finds out that he got a D on the 
final. What can he conclude? 

(a) He’d better start looking for a summer school course. 

(b) There’s still hope. 

Give reasons for your answer. 


Let P and Q be statement forms. 
(a) Write out the truth table for P © Q. 
(b) Prove that =P = —( is logically equivalent to P = Q. 


Let P(n, m) be the open sentence ‘“‘n and m are odd integers” and Q(n, 
m) be the open sentence ‘“‘nm is an odd integer.’’ Determine the truth 
or falsity of the statements that follow. Justify your answers by writing 
out complete proofs. If you feel a statement is false, state its negation 
and then prove it. Explain any logical principles you use in your proofs. 
(a) For all integers n and m, P(n, m) > Q(n, m). 
(b) For all integers n and m, Q(n, m) => P(n, m). 
(c) For all integers n and m, P(n, m) © Q(n, m). 


Explain how the statement ‘‘For all integers n and m, n or m is even © 
nm is even”’ follows logically from Exercise 12. 


Repeat Exercise 12 for the open sentences P(n, m): ‘“‘n or m is an odd 
integer” and Q(n, m): “‘n + m is an odd integer.” 


Let P(n) be the open sentence ‘‘n is even”’ and Q(n) be the open sentence 
“The positive square root of 1 is an even integer.”’ Prove or disprove 
the following statements: 

(a) For all integers n, P(n) > Q(n). 

(b) For all integers n, O(n) > P(n). 

(c) For all integers n, P(n) @ Q(n). 


Let x and y be real numbers. Let S be the statement “If xy = 0, then 
x =Oory =0.” 

(a) Write the contrapositive of statement S. 

(b) Write the converse of statement S. Prove or disprove. 


Prove that the equation x? = 4y + 3 has no integer solutions. 
Let a, b, and c be integers such that a? + b? = c’. Prove that a or b is even. 


Explain how Theorem 1.4.7 is used in the proof in Example 9. More 
specifically, what is S and what is C? 


Let n be an integer. 
(a) Prove that n is even if and only if n° is even. 
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(b) Prove that n is odd if and only if n> is odd. 
21. Prove that V2 is irrational. 


22. Let P and Q be statement forms. Prove that the statement forms P © 
Q and (P a Q) v (AP « =Q) are logically equivalent. 


23. Write the negation of the following statement: ‘‘For every integer n, n 
is even if and only if n* is even.” 


Discussion and Discovery Exercises 


D1. 


D2. 


D3. 


D4. 


DS. 


Let P Q, and R be statements. 

(a) Suppose you are asked to prove that (P 4 Q) => R is a true state- 
ment and you prove that if R is false, then P is false. Is that a valid 
proof? Explain. 

(b) Suppose you are asked to prove that (P v Q) => R is a true state- 
ment and you prove that if R is false, then P is false or Q is false. Is 
that a valid proof? Explain. 


Let P and Q be statements. Suppose that whenever P is true, then Q 
is false and whenever Q is true, then P is false. Are P and —@ logically 
equivalent? Explain. 


Let P be the statement “If n and m are integers and n + m is even, 
then n and m are odd.” In Example 7, it is proved that P is false. 
Explain the logic behind this proof. In particular, explain why using 
the integers 4 and 2 is sufficient. In the course of your explanation, 
write the negation of statement P. Explain when it is appropriate to 
use examples in a proof and when it is not. 


Criticize the following “‘proof” that if m and m are odd integers, then 
n + mis even. 

Suppose that n or m is even; say n is even and m is odd. Then 
n = 2x and m = 2y + 1 for some integers x and y. Then n + m = 
2x + 2y + 1 = 2(x + y) + 1 is odd. Therefore, if m and m are odd, 
n + m must be even. 


Bill, Francie, Matt, and Liz were relaxing in their living room. Each 

of them was doing one of the following activities: watching a football 

game, practicing the piano, playing a video game, reading a detective 

novel. Assume that two of them could be doing the same thing. Suppose 

that: 

1. Someone is reading a detective novel. 

2. Only one person is playing the piano. 

3. Bill or Francie is watching a football game. 

4. The statement “Bill is playing a video game or Francie is reading 
a detective novel” is false. 
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D6. 


D7. 


D8. 


D9. 


5. Matt is playing a video game or practicing the piano. 

6. If Bill is reading a detective novel, then Matt is watching a foot- 
ball game. 

7. If Billis watching a football game, then Liz is playing a video game. 


Match the person with his or her activity. Explain how you arrived at 
your answer and mention any rules of logic that you use. 


Look up the definition of linear independence of vectors in R" in a 
linear algebra book. Write out the definition clearly, explaining any 
notation or terminology. Analyze the definition in terms of quantifiers 
and implications. Use symbols where appropriate. 


The following problem shows how working out examples and looking 
for patterns can lead to a general solution. 

A kindly prison warden decided to free his 25 prisoners for good 
behavior. The prisoners were locked in separate cells, numbered 1 
through 25. Each cell had a lock that opened when the key was turned 
once and locked when the key was turned again. The next turn would 
open the lock, the next would close it, and so on. One night, when the 
prisoners were sleeping, the warden quietly turned each lock once, 
opening all of the cells. Then he began to worry that he had freed too 
many prisoners so he went back and turned every other lock beginning 
with cell 2, thus locking those cells. Thinking that there might be still 
too many prisoners freed he gave every third lock a turn beginning 
with cell 3. Then he turned every fourth lock, then every fifth lock, 
then every sixth lock, and so on all the way to the 25th lock. (Of course 
he only turned one lock for every 13th lock, one lock for every 14th 
lock, and so on.) 

(a) Which cells ended up unlocked? Explain how you arrived at 
your answer. 

(b) Do the numbers suggest any patterns? 

(c) If the prison had 50 cells, which cells would be unlocked? 

(d) State a general result if there are n cells. State a theorem about 
integers that you would need in order to prove this general result 
and then prove it. 


Let T be aright triangle of sides a, b, and c. The Pythagorean Theorem 

states that if T is a right triangle and c is the hypotenuse, then a’ + 

b’ = c’. There is a generalization of the Pythagorean Theorem called 

the Law of Cosines. 

(a) State and prove the Law of Cosines. 

(b) State the converse of the Pythagorean Theorem. 

(c) If this converse is true, give a proof. If it is false, give a counterex- 
ample. 


In Euclidean geometry, the sum of the angles of any triangle is equal 
to 180 degrees and as we noted in Section 1.3, in Lobachevskian geome- 
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D10. 


try, the sum of the angles of any triangle is /ess than 180 degrees. Using 
the sphere model, conjecture a statement about the sum of the angles 
of a triangle in Riemannian geometry. Can you give an example of a 
triangle on the sphere in which the sum of the angles is not 180 degrees? 


Consider the following statement: “‘Euclid was the first non-Euclidean 
geometer.”’ In light of the discussions in this chapter, does this state- 
ment make sense? Explain. 


2.1 


EXAMPLE | 


EXAMPLE 2 


SEIS 


SETS AND SUBSETS 


The Notion of a Set 


We now begin the study of a fundamental idea, that of a set or collection 
of objects. This apparently simple notion is important in every field of mathe- 
matics. But we will not give a formal definition of set. Instead, the word will 
be a basic, undefined term, part of our intuitively understood vocabulary. (Try 
to define “‘set”’ yourself. You will wind up using words such as ‘‘collection”’ or 
“group,” which are no more defined than “‘set”’ is.) We must start somewhere, 
or else every definition we give will lead to another, and another—an unend- 
ing process. We choose “‘set’”’ as a beginning. 

The objects in a set are called elements. We use the notation a € A to 
express that the object a is an element of the set A, and often simply say 
‘“‘ais in A.” You should realize that this idea of membership in a set is also 
undefined, part of the concept of set. 

There are several different ways to denote a set, to convey which objects 
are elements and which are not. 


The elements may simply be listed. For example, {1, 2, 3, 4, 5} is the set 
consisting of the first five positive integers. Putting the elements of a set in 
brackets { } is standard notation. 


If there are too many elements to list (infinitely many, perhaps!), we may 
just indicate a pattern for the elements. For example, {1, 2, 3, ... 10} would 
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denote the set of all positive integers from 1 to 10 and {1, 2, 3, ...} would 
denote the set of all positive integers. 


Special notation is sometimes used for very common sets. We use R 
for the set of real numbers, and Z (from the German ‘“‘Zahlen,”’ meaning 
‘“nnumbers’’) for the set of all integers, which might also be written {... —3, 
—2, -1, 0, 1, 2, 3, ...}. The set consisting of just the positive integers is 
denoted Z*. The set of rational numbers, those of the form m/n, where m 
and n are integers with n ¥ 0, is denoted Q (for ‘“‘quotient’’). 


Let A be the set of all integers that have 0 as one of their decimal digits. 
Now of course A has infinitely many elements, so we cannot list them, and 
determining an obvious pattern to indicate the elements may not be easy. 
Instead, we take our cue from the way we have described A: as the set of 
integers that satisfy a certain property or statement. We write 


A = {n € Z| one of the decimal digits of n is 0}. 
The symbol | is read “‘such that.” 


When a set is written in the form used in Example 3, it should consist 
of an expression before the | symbol and an expression after the | symbol. 
The expression before the | symbol should introduce the variable(s) used to 
represent the elements of the set and the expression after the | symbol should 
be an open sentence, say P(x) if there is one variable x, which definitively 
describes the elements of the set. For example, {x € Z | x is even} is the set 
of even integers and here P(x) is the open sentence ‘“‘x is even.” 

In many discussions, we will be dealing only with elements of a certain, 
fixed set, say U. Such a set U provides a context, or a ‘“‘universal set” for 
the discussion. For the set A of the previous example, the universal set could 
be taken to be the set of integers. 


{1, 2, 3, 4, 5} may also be written {n € Z| 1 <n <5}. Here U = Z. 


{x € R| -1 = x <= 1} is the set of all real numbers between —1 and 1. In 
this case, U = R. 


In general, if S is a given set, A = {x © S| P(x)} denotes the set of all 
elements x in S such that the open sentence P(x) is a true statement. We 
say that A is the truth set of the open sentence P(x). 

In many cases, the set § may be clear from the context, and we would 
just write A = {x | P(x)}. In any case, the | symbol should always be followed 
by an open sentence that describes the elements of the set in some meaning- 
ful fashion. 
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Suppose A is the set of all integer multiples of 4. One possible description 
of A is 


{x € Z| x = 4t for some integer ¢}. Here S = Z. 
Another possible description of A is 
{4t|t © Z}. 


The sentence t € Z after the | symbol indicates that ¢ takes on all integer 
values, so that as ¢ runs through the integers, 4¢ runs through the multiples 
of 4. 

An incorrect way to write A in this example would be 


{t€ Z| 44. 


First of all, the expression 4¢ after the | symbol is not an open sentence. 
Moreover, writing the expression t € Z before the | symbol is wrong because 
it is not the element ¢ that is in A but rather 4¢. 


Example 6 is one example of an important type of set of integers; namely, 
a set consisting of all the multiples of a given integer. If n © Z, we will 
denote by nZ the set of all multiples of n. So 2Z is the set of even integers, 
3Z the set of multiples of 3, and so on. In set notation, 


nZ = {x € Z| x = nt for some integer ¢} = {nt | t © Z}. 


Let S be the set of integer multiples of 7. We can describe S by listing 
elements; that is, S = {0, +7, +27, +37, ...}. Note, however, that, except 
for 0, the elements of S are not integers but real numbers, so the universal 
set is R. It would be incorrect to write S = {n € Z| nz} but rather we should 
write S = {x € R| x = nz for some integer n} or simply S = {nz | n € Z}. 


The property to be satisfied by the elements may be compound; consider 
{x © R | x > 0 and x? € Z*}. This is the set of positive square roots of 
positive integers: {1, V2, V3,2, V5, .. }. 


Many of our examples of sets come from the real numbers. In one- 
variable calculus, for instance, the functions one looks at are defined on sets 
of real numbers. One particular class of such sets are intervals. For example, 
the set in Example 5 is an interval. The different types of intervals and the 
standard notations used to represent them are the following: if a and b are 
real numbers with a < b, then 
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. [a, b] = {x © R| a = x S 5}, called the closed interval from a to b. 
(a, b) = {x € R| a <x < }}, called the open interval from a to b. 
. (a,b) =f ERl|a<x< bd}. 

. [a b) =f ER|asx< bd}. 

[a,~) ={x © R|x = ah. 

(a, ©) = {x ER |x > ah. 

(-~, a] = {x ER|x < ah. 

(—%, a) = {x ER| x < ah. 

(~%,%) =R., 


CHONDA WNHE 


The first four types of intervals are called bounded intervals and the last 
five are called unbounded intervals. Types 3 and 4 are called half-open, half- 
closed intervals. Note that the symbol © is not a real number but is only 
used as a convenient notation for unbounded intervals. 

Some of the sets we’ve seen are infinite, that is, contain infinitely many 
elements. The others are said to be finite, and we denote the number of 
elements in such a set, called the cardinality of A, by |A|. For example, 
\{1, 5, 7, 8}] = 4. A more detailed discussion of infinite sets and cardinality 
is given in Chapter 6. 


Subsets 


Usually the elements of a set belong to another, bigger set. For instance, all 
of the elements of the set of integers Z belong to the set R of real numbers. 
(Can you give an example of a set that contains all the real numbers and is 
bigger than the set of real numbers?) 


Definition 2.1.1 Let A and B be sets. We say that A is a subset of B, 
and write A C B,if every element of A is also an element of B. 

If A is a subset of B and A # B, we write A C B and say that A is 
a proper subset of B. 


Note: If A is a subset of a set B, we will usually write A C B even if A # B unless 
we wish to make particular note of the fact that A is a proper subset of B. 


{1, 2, 3} € {0, 1, 2, 3, 4}. 


{x € R| x > 0} C {x © R | x = O}. This is just the statement that posi- 
tive real numbers are nonnegative. In interval notation, we could write 
(0, ©) C [0, 2%) or (0, %) C [0, ~). 


Z*CZ,ZCQ,andQCR 


Note: The expression A C B is a statement “‘A is a subset of B.”’ It has a 
subject, a verb, and an object. Itis also a statement with an implied universal 
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quantifier. It may be expressed in the form “‘Every element of A is an element 
of B”’ or “Wx € U, where U is the universal set, if x € A, then x € B.”’ We 
can write it as “Wx € U, P(x) = Q(x)” where P(x) is the open sentence 
“x € A” and Q(x) the open sentence ‘“‘x € B.” 

Therefore, to prove the statement A C B, it is usually necessary to take 
an arbitrary element of A and show that it is an element of B. This means 
letting the variable x be assigned a value, say x = a, and then proving the 
implication ‘a € A” => “a € B.” So we start the proof by saying: let a € 
A, and then we prove that a € B. By using a variable, in this case a, which 
signifies an arbitrary but unspecified element of A, we will have proved that 
every element of A is an element of B. 


Prove that A C B where A = {n € Z| nis a multiple of 4} and B = {n € 
Z | n is even}. Here the universal set is Z. The statement A C B is the 
statement ‘‘For every integer n, if n is a multiple of 4, then n is even.” 

To prove this, we must take an arbitrary element of A and show that it 
is in B. So letn € A. Then n = 4t for some t¢ in Z. It follows that n = 2(22) 
and hence n is even. Therefore n € B. We have now established that for 
every n € Z, if n € A, then n € B. We can conclude that A C B. 


In many proofs we’ll encounter in this text, it will be necessary to prove 
that a set A is a subset of a set B. 


Consider the statement ‘‘If x is a real number and x > 10, then x > 5.” This 
statement can be worded “‘For all x € R, P(x) => Q(x)” where P(x) is the 
open sentence ‘“‘x > 10” and Q(x) is the open sentence ‘“‘x > 5.” 

If we let A = {x € R| x > 10} and B = {x E R| x > 5}, then the statement 
“For all x € R, P(x) > Q(x)” is equivalent to the statement “A C B.” 

Next, we might ask: how do we prove the statement A C B in this 
example? First, the proof depends on knowing certain facts about the real 
numbers; namely, that 10 > 5 and that inequalities satisfy a so-called transi- 
tive property: if a > b and b > c, then a > c. These properties of the real 
numbers are discussed in Chapter 7. For purposes of this proof, we will as- 
sume that these properties have already been established. 

Now let x € A. We have x > 10 and since 10 > 5, by the transitive 
property it follows that x > 5. Hence x € B. We have now proven that 
ACB. 


The following proposition gives a result that holds for all sets and is a 
further illustration of showing one set as a subset of another. 


Let A, B, and C be sets, and suppose A C Band BC C. Then A CC. 


PROOF: To prove that A C C, it is necessary to show that any element of 
A is an element of C. So we let a € A. Because A C B, we havea € B. 
But then since B C C,a € C. We can now conclude that A C C. '% 
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Sometimes two sets may be the same even though they are described 
differently. The following example is an illustration. 


Let A = {n € Z| nis even} and B = {n € Z | n’ is even}. It was proven in 
Theorem 1.4.4 that n is even if and only if n’ is even. Therefore, any element 
of A must be in B and any element of B must be in A. Therefore the two 
sets are really the same. 


Definition 2.1.3 We say two sets A and B are equal, written A = B, if 
they have the same elements. 


Note: Two sets A and B are equal if and only if every element of A is an ele- 
ment of B and every element of B is an element of A. In other words, A = B & 
(A C Ba BCA). So proving that two sets are equal often requires two proofs. 


Sometimes the statement A C B is false. Let’s analyze how to prove such 
a statement is false. 

Recall that the statement A C B contains a universal quantifier. It means 
that for all a, if a © A, then a € B. As previously noted, A C B is the 
statement “Wx, P(x) = Q(x)” where P(x) is the open sentence ‘x € A” and 
Q(x) is the open sentence ‘“‘x € B.” So the negation of A C B must be the 
statement ‘‘Jd a value of the variable x, say x = a, such that the statement 
P(a) > Q(a) is false.”” Now the only way an implication P(a) > Q(a) can 
be false is if P(a) is true and Q(a) is false; that is, if a € A but a € B. (As 
you might expect, € means “‘is not an element of.”’) 

To summarize: A C B is false if Ja € A such that a € B. 


The set A = {1, 2, 3,4, 5}is not a subset of B = {1, 2, 3,4, 6}, since 5 € A 
but 5 € B. 


Consider the statement 
All triangles in the Euclidean plane are isosceles. 


This statement is equivalent to the statement A C B where A is the set 
of all triangles in the Euclidean plane and B is the set of all isosceles triangles 
in the Euclidean plane. 

The statement A C B is false. To prove it is false, we need merely find 
an element of A that is not in B. Since the right triangle with sides of length 
3, 4, and 5 is such an element, the proof is complete. 


Let our universal set U be the set of all real-valued functions of one variable. 
Let [a, b] be a closed interval on the real line; that is, [a,b] = {x ER|as 
x = b}. Let A = {f| fis continuous on [a, b]} and B = {f| fis integrable on 
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[a, b]}. It is an important theorem of calculus that every continuous function 
on [a, b] is integrable on [a, b]. In other words, A C B. The converse of this 
statement is B C A. This converse, however, is false since there are integrable 
functions that are not continuous. The function f defined by f(x) = Oif x = 
a and f(x) = 1 if a < x = b is one example of an element of B that is not 
in A. 


Complements 


As the previous examples suggest, we often want to determine if a given 
element is not in some set. We introduce the following definition. 


Definition 2.1.4 Let A, B be sets. The complement of A in B, denoted 
B-—A,is{b € B|b ¢ A}. 


{1, 2,3, 4,5} — {1, 3} = {2, 4, 5}. 


{1, 2, 3, 4, 5} — {1, 7, 8} = {2, 3, 4, 5}. Notice that there is no need for either 
set to be a subset of the other. 


For convenience, if U is a universal set, we will write U — A = A, called 
simply the complement of A. 

For example, if our universal set is Z, so that we are only considering 
integers, then Z* = {... -—3, -2, -1, 0}, the nonpositive integers. 

Sometimes we encounter the set that contains no elements. For example, 
if U = Rand A = {x | x? < 0}, then A has no elements since for every real 
number x, x’ = 0. We call a set with no elements the empty set, and we 
denote it ©. = = 

Note that if U is a universal set, then U = Wand = U. 


The following result shows how the statement A C B is affected by taking 
complements. Note how the proof just involves applying a rule of logic that 
we discussed in Chapter 1. 


Let A and B be sets, contained in some universal set U. Then A C B if and 
only if BCA. 


PROOF: Let x € U. Let P(x) be the open sentence ‘“‘x € A” and Q(x) be 
the open sentence ‘‘x € B.” Then —P(x) is the open sentence “x € A” or 
equivalently “x € A” and —Q(x) is the open sentence “‘x ¢ B” or equivalently 
ba E B.” 

A C Bis the statement “Wx, P(x) = Q(x)” and B CA is the statement 
“Wx, =Q(x) = AP(x).” Since for any value a of x, P(a) > Q(a) is logically 
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equivalent to its contrapositive —Q(a) = —P(a), it follows that A C B if and 
only if BCA. © 


Let A = {f| fis continuous on [a, b]} and B = {f | fis integrable on [a, b]} 
as in Example 18. As we saw in that example, the true statement from 
calculus ‘‘Every continuous function on [a, b] is integrable on [a, b]’’ is 
equivalent to the statement A C B. The statement B CA is the statement 
“Tf fis not integrable on [a, b], then f is not continuous on [a, b].’’ This latter 
statement is also true and requires no proof since it is logically equivalent to 
the statement A C B. 


MATHEMATICAL PERSPECTIVE: INTERVALS 


Earlier in this section we discussed the different types of intervals on the 
real line. Intuitively, an interval is a set of real numbers without “‘holes” in 
it. This would mean that if x and y are two numbers in the interval then all 
numbers between x and y are in the interval. A formal definition of interval 
would be the following: 


Definition 2.1.6 A subset J of the set of real numbers R is called an 
interval if J # ©, I contains more than one element, and for every x 
and y in J such that x < y, [x,y] CL. 


It is not hard to show that any of the nine types of intervals previously 
mentioned are in fact intervals by this definition. (See Exercise 24 following.) 

The difference between an open interval (a, b) and the closed interval 
[a, b] is more profound than the simple fact that the closed interval contains 
the endpoints and the open interval does not. Many mathematical results 
that are true for closed intervals are not true for open intervals and vice 
versa. For example, the Max-Min Theorem of calculus states that if a real 
valued function f is continuous on the closed interval [a, b], then f has a 
maximum and a minimum value on [a, b]. This theorem is false if the closed 
interval [a, b] is replaced by the open interval (a, b). The function f defined 
by f(x) = x?, for instance, is continuous on the open interval (0, 1) but has 
no maximum or minimum value on this interval. 

Another example is the theorem from calculus that says that if a real- 
valued function fis continuous on the closed interval [a, b], then fis bounded 
on [a, b]. (Recall from Example 14 of Section 1.1 that a real-valued function 
fis bounded on a closed interval [a, b] if there exists a positive real number 
M such that | f(x) | = M for all x € [a, b].) As in the Max-Min Theorem, 
this theorem is false for open intervals. For example, the function f(x) = 
1/x is continuous on the open interval (0, 1) but is unbounded on this in- 
terval. 
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Both this last theorem and the Max-Min Theorem are usually not proved 
in a beginning calculus course because the proofs depend on a property of the 
real numbers called the Least Upper Bound Axiom. This axiom is discussed in 
Chapter 7. 


Exercises 2.1 


1. Describe the following finite sets by listing their elements. 

(a) The set of all positive integers between 15 and 21 inclusive 

(b) The set consisting of the 8 smallest squares of positive integers 

(c) The set of all integers of absolute value less than 1 

(d) The set consisting of the areas of circles of radius 1, 2, and 3 

(e) {x € Z| x? < 13} 

(f) The set of all U.S. presidents who were born in the twentieth century 

(g) The set of the first ten prime numbers that are the sums of two 
squares. (A positive integer p is a prime number if p > 1 and the only 
Dositive integers that are factors ofp are 1 and p.) 

(h) The set of the first 20 positive integers that are the sums of two odd 
prime numbers 

(i) The set of all remainders when a positive integer is divided by 7 


2. Describe the following sets by listing enough elements to indicate a 

pattern for all of the elements of the set. 

(a) {x € Z| x > 10} 

(b) {x € Z| x? > 4} 

(c) The set of all integer multiples of 4 

(d) The set of all positive integers that have a remainder of 3 when 
divided by 4 

(e) The set of all remainders when a positive integer is divided by 93 

(f) The set of all integers that are sums of consecutive odd integers 
starting at 1 

(g) The set of all numbers x for which sin x = 0 

(h) The set of all numbers x for which cos x = 0 


3. (a) Based on your answer to 1(g), make a conjecture about all prime 
numbers that are the sums of two squares. 
(b) Based on your answer to 1(h), make a conjecture about all positive 
integers that are the sums of two odd prime numbers. 
(c) Based on your answer to 1(i) and 2(e), make a conjecture about the 
set of all remainders when a positive integer is divided by a positive 
integer n. 


4. Describe the following sets by writing them in the form {x € S|...} for 
the appropriate set S. 
(a) {0, +1, +2, +3, +4, +5, +6} 
(b) The closed interval [1, 2] 
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(c) The set of all rational numbers between 0 and 1 inclusive 

(d) The set of all integer multiples of 4 

(e) (3, -1,7, -5, 11, -9,...} 

(f) The set of all positive integers that have a remainder of 3 when 
divided by 4 

(g) The set of all integer multiples of 7/2 

(h) The set of all real numbers x for which cos x = 0 

(i) The set of all real numbers whose distance from the origin is greater 
than 5 


. Let A be the set of all integer multiples of 5: A = 5Z. 


(a) Express A in the form {x € S| ...} for the appropriate set S. 
(b) Prove that if x,y € A, thenx + y€ A andxy EA. 


. Letn € Zand let A = nZ. Prove that if x, y € A, then x + y © A and 


xy EA. 


. Let P(x) be the open sentence ‘‘x < 5’’ and Q(x) be the open sentence 


“x = —1.” Express the following sets in interval notation if possible. If 
the set cannot be expressed in interval notation, give a reason. 

(a) {x © R| P(x) » O(x)}. 

(b) {« ER | P(x) v O(x)}. 

(c) & ER | 4P(x)}. 

(d) {x © R | AO(x)}. 

(e) {x E R | P(x) A 4Q(x)}. 

(f) {x € R | P(x) a O(a)}- 

(g) {x © R | P(x) - O(x)}. 

(h) & © R | P(x) v O(a}. 


. For the following open sentences P(x), express the set {x © R | P(x)} in 


interval notation if possible. If the set cannot be expressed in interval 
notation, give a reason. 

(a) |x| <4 (b) |x| <7 

(c) |x| >2 (d) |x-1|<3 

(ey (ea 7 | 9 


. Let A be the set of all integer multiples of 9 and B be the set of all 


integer multiples of 3. 
(a) ee the statement A C B as a statement of the form ‘“‘For all 
. then...” but without using the symbols A and B. 

(b) ae A and Bi in the form {x € S| .. .} for the appropriate set S. 

(c) Prove A C B. 

(d) Write the negation of the statement in part (a). 

(e) Write the contrapositive of the implication in part (a). Is it true or 
false? Give a reason for your answer. 

(f) Write the converse of the implication in part (a). Then prove it or 
disprove it. 


2. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
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Let n and m be integers. Let A = nZ and B = mZ. 
(a) Prove that if ” is a multiple of m, then A C B. 
(b) Write the converse of the statement in part (a). Prove or disprove. 


Let U be the set of all real-valued functions of one variable. Let [a, b] 
be a closed interval on the real line. Repeat Exercise 9 for the sets 
A and B where A is the set of all functions that are differentiable on 
[a, b] and B is the set of all functions that are continuous on [a, b]. You 
may use any relevant theorem from calculus to do this problem. 


Let A = {n € Z| nis a multiple of 4} and B = {n € Z| n’ is a multiple 
of 4}. 

(a) Prove that A C B. 

(b) Is B C A? Prove or disprove. 


Let A = {n € Z|n + 3 is odd}. Prove that A is equal to the set of all 
even integers. 


Let A = {n € Z|n = 8 + 7 for some ¢t € Z} and B = {n€ Z|n= 
4¢ + 3 for some t € Z}. 

(a) List 5 elements of A and 5 elements of B. 

(b) Is A C B? Prove or disprove. 

(c) Is B € A? Prove or disprove. 


Let A = {n€ Z|n = 4¢ + 1 forsomet € Z} andB={nEZ|n= 
4¢ + 9 for some t € Z}. Prove that A = B. 


Let A = {n € Z|n = 3t + 1 for some rt € Z} and B = {n€ Z|n = 
3t + 2 for some t € Z}. Prove that A and B have no elements in common. 


Find sets A and B so that the following statements can be expressed in 
the form A C B. Also write the universal set U. 

(a) Every hexagon has 6 sides. 

(b) Every polynomial is differentiable on R. 

(c) All prime numbers greater than 2 are odd. 

(d) Every integer that is a multiple of 6 is even. 


Without using set notation, write the statement BCA, where A and B 
are the sets defined in 

(a) Exercise 9 

(b) Exercise 11 

(c) Exercise 17(a) 

(d) Exercise 17(b) 

(e) Exercise 17(c) 

(f) Exercise 17(d) 


Compute the complements of the following sets. 
(a) A={xEZ|x>5},U=Z. 
(b)A={xEZl|1lsxs10,U=Z. 
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20. 


21. 


22. 


23, 
24. 
25, 


(c) A={rEZ|1sx<10},U=R. 
(d) A={xER|-2<x<4,U=R. 
(e) A={xE R|x<2orx > 20},U=R. 


For the following sets A and B, compute A — Band B — A. 
(a) A = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}, B = {6, 7, 9, 11, 12}. 
(b) A = {1, 2, 3,..., 100}, B = {1, 3, 5,..., 99}. 

(c) A = {0,1, 2, 3,...}, B = {0, 1, 2, 3}. 

(d) A = {0, 1, 2, 3,...}, B = {2, 4,6, ...}. 

(e) A= {x ER|x > 4}, B = {x ER| x < 10}. 

(f) A={xER|x > 4}, B= {x ER|x > 10}. 


Let A and B be sets contained in a universal set U. Which one(s) of the 
following statements are equivalent to the statement A C B? Give rea- 
sons for your answers. 

(a) Vx © U,x € Aandx EB. 

(b) Vx € U, if x ¢ B, then x ¢ A. 

(c) ax CEUBxEAandx EB. 

(d) Vx € U, if x € A, then x € B. 

(e) Vx € U, if x € A, then x ¢ B. 


It has been noted in this section that two sets A and B are equal if and 
only if A C B and B C A. Using this characterization of equal sets, 
complete the following statement “Two sets A and B are not equal if 
and only if....” 


Use Exercise 11 of Section 1.3 to prove Proposition 2.1.2. 
Prove that the open interval (a, b) isin fact an interval by Definition 2.1.6. 


(a) Give an example to show that the complement of an interval need 
not be an interval. 
(b) Give an example of an interval whose complement is also an interval. 


Discussion and Discovery Exercises 


D1. 


D2. 


Let A and B be sets. Criticize the following ‘“‘proof” that A C B or 
BCA. 


Let P be the statement ‘“‘x € A” and Q be the statement ‘“‘x € B.” 
Then A C Biss the statement P => Q and B C A's the statement O > 
P. Exercise 9 of Section 1.3 asserts that the statement (P > Q) v 
(Q = P) is always true. Hence the statement A C B or B C A is 
always true. 


Let A = {x € R| x < l} and B = {x ER | x < 0}. Criticize the following 
“proof” that B C A. 


xEB x<0<1 xEA 
Therefore, B C A. 


2.2 


EXAMPLE | 


EXAMPLE 2 


EXAMPLE 3 


EXAMPLE 4 
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D3. Does the statement ‘‘A is not subset of B”’ mean that if x € A, then 
x € B? Give reasons for your answer. 


D4. Show how you might ‘‘discover” the truth of Theorem 2.1.5 before 
actually giving a formal proof of it. 


DS. Explain why the function f(x) = x? does not have a maximum or a 
minimum value on the open interval (0, 1). 


D6. Give an example of a function that does not have a maximum or a 
minimum value on the closed interval [0, 1]. Why does your example 
not contradict the Max-Min Theorem? 


COMBINING SETS 


Unions and Intersections 


It is possible to create new sets out of two or more given sets. The two most 
common ways of doing this are given in the following definition. 


Definition 2.2.1 Let A and B be sets. The union of A and B, denoted 
AUB, is{x|x EA vx © B}. 
The intersection of A and B, denoted AM B, is {x|x € A Ax € B}. 


For an element x to be in the union of two sets, x must be in one or the 
other of the two sets, although it could be in both. To be in the intersection 
of two sets, x must be in both sets. 


{1, 2, 3, 4, 5} U {6, 7} = {1, 2, 3, 4, 5, 6, 7}. 


{1, 2, 3} U {2, 3, 4} = {1, 2, 3, 4} and {1, 2, 3} M {2, 3, 4} = {2, 3}. Notice that 
we cannot compute |A U B| by simply adding |A| and |Bl. 


(0, 3] U [2, 5] = [0, 5] and (0, 3] 1 [2, 5] = [2, 3] but note that [0, 3] U 
[4, 5] is not an interval. 


We encounter something of a puzzle if we try to compute, for example, 
{1, 2} N {3, 4} or [0, 3] N [4, 5]. There are no elements that are in both of 
these sets; the intersection, therefore, must contain no elements! So the 
intersection is the empty set. 

We say two sets are disjoint if their intersection is the empty set. 
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In order to visualize the union and intersection of sets, we often use the 
interiors of circles to represent a set. This type of representation is called a 
Venn diagram. 

Next we see the Venn diagrams for the union and intersection of two sets. 


AUB ANB 


It is clear from this picture that the order in which we write the union 
and intersection does not matter. In other words, for any sets A and B, 
AUB=BUAandANB=BNA. Of course, a picture is not a proof 
because a set is not necessarily the interior of a circle. But the picture helps 
us to understand why these properties are true. 

If we have three sets A, B, and C, the following diagrams represent their 
union and intersection. 


as OO 


AUBUC ANBOC 


It is clear from these pictures that (A U B) UC = A U (B U C) and 
(AN B)NC=AN(BNC). 

We list these properties of sets as well as some additional ones in the 
following proposition. 


Proposition 2.2.2 The following statements are true for all sets A, B, and C. 


1AUB=BUA. 
2 ANB=BQNA. 


EXAMPLE 5 


EXAMPLE 6 
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(AUB)UCH=AU(BUC). 
S(ANB)NC=HAAN(BNC). 

-ACAUB. 

-ANBCA. 

. © C A; that is, the empty set is a subset of every set. 
-AUD=A., 

-AND=L. 


PROOF: Properties 1 through 4 are clear from the appropriate Venn diagrams 
above. Similarly, Venn diagrams can be drawn to illustrate Properties 5 and 
6. (See Exercise 10.) 

For 7, a diagram is no help because we cannot draw the empty set. 
(There’s nothing there!) 

To prove that © C A, it is necessary to prove the statement Vx € U, 
P(x) = Q(x), where P(x) is the open sentence ‘“‘x € @” and Q(x) is the 
open sentence “‘x € A.” But for any choice of the variable x, the hypothesis 
P(x) is false since @ contains no elements. Therefore, the implication 
P(x) = Q(x) is always true. (Why?) Hence @ C A for every set A. 

We leave the proofs of 8 and 9 to the exercises. #= 


OMmANDADANA WW 


Note that Properties 3 and 4 allow use to write AU BU Cand AN 
BN C without ambiguity. 

Note also that we did not give formal proofs of Properties 1 through 6 
of Proposition 2.2.2, since the Venn diagrams made the properties seem 
obvious. That will not always be the case, so it is necessary to know how to 
write out formal proofs of properties of sets. Usually such proofs require 
proving either that two sets are equal or that one set is a subset of another. 
Recall that proving two sets are equal means giving two proofs; namely, that 
each of the two sets is a subset of the other. Here are two examples. You 
will see more in the exercises. 


Let A and B be subsets of a universal set U. We will prove that A — B = 
A 1 B. To prove this statement, we need to prove that A — B is a subset 
of AN Band that A M B isa subset of A — B. 

First, we prove that A — B is a subset of AM B. To do this, we must 
demonstrate that any element of A — B is an element of A M B. Let x € 
A — B. Then x € A and x ¢ B. Sox € A and x € B, which implies that 
x € ANB. Therefore, A — BCANB. 

Now, we show that AM B is a subset of A — B. Let x € AN B. Then 
x € A and x € B, implying that x € A and x € B. Thus x € A — B. This 
proves that AN BCA — B. 

We can now conclude that A N B =A — B. 


Let A and B be subsets of a universal set U. We will prove that A CB @ 
AUB=B. 
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This example is different from the previous one in the sense that it is 
necessary to prove two sets equal but within a biconditional; that is, under 
the assumption that A C B, we must prove that the sets A U B and B are 
equal. Then assuming that these two sets are equal, we need to prove that 
A is a subset of B. The proof goes as follows. 


PROOF: Suppose that A C B. We will prove that A U B isa subset of B and 
Bis asubset of AU B. Letx € AUB. ThenxE€ AorxE€ BIX EA, 
then x € B because A C B. So m either case, x € B. This proves that A U 
B C B. The fact that B C A U B is a consequence of Proposition 2.2.2. 
Hence AUB=B. 

Now suppose that A U B = B. We want to prove that A C B. Letx © 
A. Then x € A U B by Proposition 2.2.2. But we are assuming that A U 
B=8B,and sox € B. It follows that A C B. &* 


The following two properties of union and intersection are a bit more 
complicated to prove than the properties given in Proposition 2.2.2. We draw 
the Venn diagrams first to illustrate why the theorem is true and then get 
to the formal proofs. 


Let A, B, and C be sets. Then 


1. 
2. 


N (BUC) =(ANB)U(ANC). 
U(BNC)=(AUB)N (AUC). 


> b 


Way, AY, 
y y 


AN(BUC)=(ANB)UANC) AU(BOC)=(AUB)N(AUC) 


PROOF: We give two proofs of 1 and leave the proof of 2 as an exercise. 

(First Proof) Let x € A M(B UC). Then x € A and x € BUC, so 
xEBorxEC. 

If x € B, then since we know that x € A, we have x CAMB. ButAN 
BC(AN B)U(AN OC) by Property 5 of Proposition 2.2.2, sox € (A NM 
B)U(AN C). 

Similarly, if x © C, we have that x € A M C and thus that x € (A MN B) 
U(ANC). We have therefore shown that A 1(BUC) C(ANB)U(ANC). 


EXAMPLE 7 


Theorem 2.2.4 
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To establish the reverse, letx € (AM B)U(A NC). Thenx EG ANB 
orx EC ANC. 

In the former case, we have x € A and x € B, but since BC BUC 
(why?),xE& BUC,sox EAN (BUC). 

A similar argument works in the latter case, so in either case we have 
xEAN(BUC). 

Therefore, (AM B)U(ANC)CAN(BUC). 

We have now proven that A 1 (BUC) =(AN B)U(ANC). 

(Second Proof) This proof relies on Example 15 of Section 1.2. 

Let x € U. Let P, Q, and R be the following statements: P: ‘“x € A,” Q: 
“x © B,’ and R: “x € C.” 

Then P ~ (Q v R) is the statement ‘‘x € A and either x € Bor x € C,” 
which is equivalent to the statement “‘x € AM (BU C).” Similarly, P ~ Q 
is the statement ‘x € A B,’ P ~ R is the statement “x € AMC,” and 
(P a Q) v (Pa R) is the statement “x € (AN B) U (AN C).” 

Since P a (Q v R) is logically equivalent to (P a Q) v (Pa R) by Example 
15 of Section 1.2, it follows that x € AM (B U C) if and only ifx E (AN 
B) U(AN C). ThusAN (BUC)=(AN B)U(ANC). 


DeMorgan’s Laws 


The following example shows what happens when we take complements of 
unions and intersections. 


Let U = { 1, 2, 3, 4, 5, 6, 7, 8, 9, 10} be our universal set. Let A = {2, 3, 4, 
5, 6} and B = {1, 4, 7, 10}. 

Then A = {1, 7, 8, 9, 10} and B = {2, 3, 5, 6, 8, 9}. 

Now A U B = (1, 2, 3, 4, 5, 6, 7, 10} and so A U B = {8, 9}. 

Note that A U B is just the intersection of A and B. Similarly, since AN 
B = {4}, we see that AN B=AUB. 

Now we prove that this example can be generalized to all sets. The results 
are known as DeMorgan’s Laws. 


Let A and B be sets contained in some universal set U. Then 


1AUB=ANB. 
2, ANB=AUB. 


PROOF: We give two proofs of 1 and leave the proof of 2 as an exercise. 

(First Proof) We need to show two things: AU BC AN Band AN 
BCAUB. 

To show that A U B C AN B, we start by letting x € A U B. This 
means that x is not in A U B and so x cannot be in either A or B. Thus x 
is not in A and x is not in B. Hence x € A and x € B or equivalently x € 
AN B. This proves that A U BC ANB. 


66 


EXAMPLE 8 


EXAMPLE 9 


CHAPTER 2 SETS 


Conversely, let x € A M B. Then x € A and x € B or equivalently x € 
A and x €& B. So x is not in either A or B, or in other words x € A U B. 
Hence x € A U B. ThusAM BCA UB. It follows that A UB = ANB. 

(Second Proof) Let x € U. Let P be the statement ‘‘x € A” and Q be 
the statement ‘‘x € B.” 

Then P v Q is the statement ‘“‘x € A or x € B,” which means “x € 
A U B.” So -(P v Q) is the statement “x € A U B” or “x CA U B.” 

Also, 4P is the statement ‘‘x ¢ A” or equivalently “x € A” and —Q is 
the statement ‘“‘x ¢ B” or equivalently “x € B.” It follows that —P ~ 4Q 
is the statement ‘“‘x € A and x € B” or equivalently “x € AM B.” 

We saw in Section 1.2 that =(P v Q) and —=P ~ —@ are logically equivalent 
statement forms. It follows that x € A U B if and only if x € AN B. 

Therefore, A UB=ANB. & 


Cartesian Products 


We sometimes think of the union of two sets as their ‘‘sum,”’ as if we had 
“added” them, although, as we saw in Example 2, not even the numbers of 
elements in the sets are really added. Nevertheless, you might wonder if 
there can be a similar analog to multiplication. We now define a set that is 
the “product” of two sets in a very satisfactory way. 


Definition 2.2.5 Let A and B be sets. The Cartesian product of A and 
B, written A x B, is {(a, b) |a € A and b € B}. 


Informally, A x B is a set of pairs of objects. The first object is chosen 
from A and the second from B. Notice that we distinguish the order (these 
are commonly called ordered pairs). 


Let A = {1, 2} and B = {3, 4, 5}. Then A x B = {(1, 3), (1, 4), (1, 5), (2, 3), 
(2, 4), (2, 5)}. 

It’s worth making a few comments about this simple example. First, 
notice how important order is: (1, 3) € A X B but G, 1) € A X B. Even 
more basic, notice that (1, 3) is one element of A X B, which is made up of 
two objects. In fact, |A < B| = 6; there are six ordered pairs. Here by the 
way is a reason why A X B is called a product: if A and B are finite sets, 
|A < B| = |A||B]. You will be asked to prove this in Chapter 5. (See Exercise 
34 of Section 5.2.) 


There is no reason why A and B must be different sets. For example, R < 
R = {(x, y) | x, y © R}. This should be a very familiar set to you: it’s usually 
visualized as the Cartesian plane in analytic geometry and calculus. Again, 
be aware of the importance of order in these sets; you know that (1, 2) is 
not the same point in the plane as (2, 1). 


Theorem 2.2.6 
Corollary 2.2.7 


Corollary 2.2.8 


Theorem 2.2.9 
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HISTORICAL COMMENTS: IRRATIONAL NUMBERS 


As the theory behind the concepts of calculus developed in the 19th century, 
it became clear to mathematicians that a rigorous definition of the real 
number system was necessary. The difficult part of any definition of the real 
numbers was defining the concept of irrational number. (At this point, the 
properties of the integers and rational numbers were taken for granted.) 

In Section 1.4, we proved that V2 is irrational, a proof that is attributed 
to the Pythagoreans. The discovery that there could be a mathematical 
quantity that is not the ratio of two whole numbers was startling to the 
Pythagoreans. But it was not something that could be ignored, because the 
diagonal of a unit square has length V2. Such quantities were called incom- 
mensurable ratios as opposed to fractions, which were called commensurable 
ratios. Although irrationals were used freely in calculations, they were not 
fully accepted as numbers even as late as the 17th century. It was not until 
the 19th century, most notably with the work of Karl Weierstrass (1815- 
1897), Georg Cantor (1845-1918), and Richard Dedekind (1831-1916), that 
a formal definition of irrational number was given. 

Proving that a given real number is irrational can be a difficult problem 
in general. For example, it is considerably harder to prove that 7 and e are 
irrational than it is to prove that V2 is irrational and, although it is conjec- 
tured that 7 + e is irrational, no proof has yet been found. The irrationality 
of 7 was first proved in 1761 by J. H. Lambert (1728-1777) by means of 
continued fractions. There are many other proofs today. A general theorem 
about the trigonometric functions is the following, which we state without 
proof. We leave the corollaries as exercises. 


If r is a nonzero rational number, then cos r is irrational. 
7 is irrational. 


If r is a nonzero rational number, then sin r, tan r, sec r, csc r, and cot r are 
all irrational. 


The proof that e is irrational is easier than the one for 7 so we give the 
proof next. 
e is irrational. 


PROOF: As with most of these irrationality proofs, we assume that e is rational 
and derive a contradiction. First of all, recall that e can be represented by 
the infinite series 


This is just the MacLaurin series for e* with x = 1. 
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Suppose now that e = n/m where n and m are positive integers. Notice 
that the expression 


ck dd orl 1 
Moe fr oh Bb pl 


is an integer. (See Exercise 33 following.) 
Using the infinite series just given, we get that 


1 1 1 
i ae Pied WAG eee = ‘ 
ise (on (m+2)! (m+)! t © Zfor some ¢ 


Multiplying through by m! on the left-hand side gives 
js 1 re 1 1 ti 
mt lo (mt1\(mt+2)) (m+1)m+2)\(m +3) 
7 > 1 
EG (m+1)(m + 2)...(m +k) 


For each positive integer k, 


1 1 
(m+l(m+2)...(m+k) (nt ly 


and the geometric series > 1/(m + 1)* converges to 1/m. (See Exercise 
k=l 
34 following.) 
We now get the contradiction that ¢ is a positive integer and ¢ < 1. 
Therefore, e must be an irrational number. @ @ 


The following theorem is a generalization of Theorem 2.2.9 but more 
difficult to prove. We state it without proof. 


If r is a nonzero rational number, then e’ is irrational. 


An excellent source for problems of this type, including a proof of Theo- 
rem 2.2.10, is Irrational Numbers by Ivan Niven [16]. 


Exercises 2.2 


1. Let A = {1, 2, 3, 4, 5}, B = {0, 1, 4, 8}, and C = {2, 5, 7, 9, 11, 13, 17}. 
Compute each of the following. 


(a) AUB. (b) ANB. 
(c) ANC. (d) A -B. 
(ec) A- (BUC). (f) (A- B)U(A-C). 
(2) AN (BUC). (h) (AN B)U(ANC). 


(i) AU(BNC). (j) (AUB) NA (AUC). 
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. Let A, B, and C be the sets given in the previous exercise and let the 


universal set U = {0, LDS wes 20}. Compute the following. 
(a) A _ (b) B (c) ANB 
(d) AUB (ec) AUB (f) ANB 


. (a) Let A, B, and C be the sets given in Exercise 1. Compute |A U Bl, 


|A U C|, and |B U Cl. 
b) Let A and B be arbitrary finite sets. Based on your answers to part 
y y p 
(a), deduce a formula for |A U B| and then prove it. 


. Let U = R and let A, B, and C be the following intervals: A = [1, 4], 


B = (0, 2), and C = [2, »). Compute the following. Express your answer 
as an interval or a union of intervals whenever possible. 


(a) AUB (b) ANB (c) AUC 
(d) ANC (e) BUC (ff) BAC 
. Let A, B, and C be the sets given in Exercise 4. Compute the following. 
(a) A (b) B (c) A- B 
(d) B-A (e) A-C (f) B-C 


. Express each of the following sets as an interval or a union of intervals: 


(a) {x ER || x | > 6} 

(b) tx E R|| x — 3| > 10} 
(c) &§ ER|O<]|x|< 1} 
(d) {x €ER|O0<|x-1]< 2} 


. Let A and B be open, bounded intervals such that AN B # ©. 


(a) Prove that A U B is an open interval. 
(b) Prove that A M B is an open interval. 


. Let A and B be closed, bounded intervals such that AN B # ©. 


(a) Is A U B a closed interval? Prove your answer. 
(b) Is AM B a closed interval? Prove your answer. 


. Let A be the set of all integer multiples of 5 and B the set of all even 


integers. Compute 
(a) AUB (b) ANB 
(c) A-B (d) B-A 


. Draw Venn diagrams to illustrate Properties 5 and 6 of Proposition 2.2.2. 


Then give formal proofs of these properties. 


. Prove Properties 8 and 9 of Proposition 2.2.2. 
. Draw Venn diagrams to illustrate Examples 5 and 6 of this section. 


. Let A = {x © R|x <= 10} and B = {x © R|x > 5}. Prove that 


(a) AN B = (5, 10]. 
(b) AUB=R. 
(c) A — B= (-~, 5]. 
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14. 


15. 


16. 


17. 


18. 


19. 


20. 
21. 


22. 


23. 


24. 


25. 


26. 


Let A and B be sets. Prove the following: 
(a) AN (AUB) =A. (b) AU (AN B) =A. 


Let A, B, and C be sets. 

(a) Prove that A C(BNC)S(ACB)A(ACC). 
(b) State the contrapositive of part (a). 

(c) State the converse of part (a). Prove or disprove. 


Let A, B, and C be sets. 

(a) Prove that (A CO)A(BCC)SAUVUBCC. 
(b) State the contrapositive of part (a). 

(c) State the converse of part (a). Prove or disprove. 


Let A, B, and C be sets. 

(a) Prove or disprove: If AC BUC,then A C BorA CC. 
(b) State the contrapositive of part (a). 

(c) State the converse of part (a). Prove or disprove. 


Let A, B, and C be sets. Prove that if A is not a subset of C, then A is 
not a subset of B or B is not a subset of C. 


Give two proofs of part 2 of Theorem 2.2.3 similar to the two proofs of 
part 1 of that theorem. 


Draw Venn diagrams to illustrate the two parts of Theorem 2.2.4. 


Give two proofs of part 2 of Theorem 2.2.4 similar to the two proofs of 
part 1 of that theorem. 


Let A and B be subsets of a universal set U. Prove the following: 
(a) (AUB)NA=B-A. 
(b) (A U B) — (AN B) = (A — B)U (B—- A). 


Let A and B be subsets of a universal set U. Prove that A CB SAN 
B= A. 


Let A and B be subsets of a universal set U. 
(a) Prove thatA =A-BSOANB=2f. 
(b) Prove that AUB=USACB. 


Let A, B, and C be sets. Prove the following: 
(a) A- (BUC) =(A-B)N(A-C). 
(b) A-(BNC)=(A-B)U(A-C). 


Characterize the following expressions as either: 

(i) a statement, 

(ii) not a statement but an expression that makes sense mathematically, 
(iii) an expression that makes no sense mathematically. 

Assume that A and B are sets and P and Q are statements. 
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(a) BCA. (b) BNA. 

(c) ASB. (d) PUQ. 

(e) P= (A = B). (f) A+B. 

(g) (B-A)>@Q. (h) Vx EP x EQ. 
(i) PCQ. (j) AAB. 

(k) P=Q. 


27. Write out the elements of A X B where A = {—1, 0} and B = {7, 12, 
19, 21}. 


28. Let Q and I be the sets of rational and irrational numbers respectively. 
(a) Prove that ifr © Qandx EI, thenr+xEL 
(b) Prove that ifr € Q,r #0, andx EI, thenrx EL. 


29. (a) Is the sum of two irrational numbers always irrational? If true, give 
a proof and if false, give a counterexample. 
(b) Is the product of two irrational numbers always irrational? If true, 
give a proof and if false, give a counterexample. 


30. Prove Corollary 2.2.7. 
31. Prove Corollary 2.2.8. 
32. State the converse of Theorem 2.2.6. Prove or disprove. 


33. Assuming that e = n/m as in the proof of Theorem 2.2.9, prove that the 
expression 


is an integer. 
34. Prove that the geometric series 
pe ee 
ii (im + 1) 
converges to 1/m. 


35. Prove that if r is a positive rational number and r # 1, then In r is 
irrational. (Note: In denotes the natural logarithm function.) 


Discussion and Discovery Exercises 


D1. Let A and B be sets. Criticize the following ‘“‘proof” of Example 6; that 
ACBSeAUB=B. 


Letx € AC B.Thenx€ Aand Bsox€ AU BandAUB=B. 
Letx € AUB =B8B.Thenx € A orx € Bandx € B. Therefore A C B. 
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COLLECTIONS OF SETS 


In this section we consider sets whose elements are themselves sets. For 
example, we could consider the collection of all sets of integers that contain 
the integer 1. Such a collection, let’s call it S$, could be expressed in the 
following way: S = {A C Z| 1 € A}. S is a set whose elements are sets, 
specifically, sets of integers. The set {1, 2, 3} is an element of S, as 
is the set O of odd integers and the set A = {n € Z|n < 10}. The set B = 
{n € Z|n = 5}, on the other hand, is not in S. Note also that the integer 
1 is not in S, but {1} is in S. 


Power Set 


An important example of a set of sets is the power set of a set A. It is defined 
as follows: 


Definition 2.3.1 Let A be a set. The power set of A, written P(A), is 
{X|X CA}. 


The power set of A, then, is the set of all subsets of A. The elements of 
P(A) are themselves sets of elements of A. 


Let A = {1, 2, 3}. Then, for example, {1, 2} C A, so {1, 2} € P(A). Notice 
that 1 ¢ P(A). However, {1} is a subset of A, so {1} € P(A). 

By the same token, A € P(A) and at the other extreme, © € P(A); that 
is, © C A. This was proven in Proposition 2.2.2. 

For completeness, here’s a listing of the whole power set of A: 


P(A) = {©, {1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3} 


Indexing Sets 


Before introducing other examples of sets of sets, we need to introduce 
some notation. 

Suppose that we have a finite number of sets, A,, A2,...A,, all contained 
in some universal set U. The collection of such sets would be written {A,, 
Az, ...An}. The subscripts that are used to distinguish the sets in this collec- 
tion form what is called the indexing set for the collection. If we denote 
this indexing set by J, then / = {1, 2, . . . nm} and this collection can be written 
{A;|i € D. 


EXAMPLE 2 


EXAMPLE 3 


EXAMPLE 4 
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Suppose that we wanted to talk about the union of the sets in this 
collection. This would be the set A, U A, U... UA, = {x © U|x © A; for 


some i = 1, 2,... n}. A shorthand notation for this union is U Aj. 
be 
Similarly, the intersection of the sets in this collection is written 


(1) A, = {x © U| x © A, for alli = 1,2,... nh. 


Let n be a fixed positive integer and let A; = {i, i + 1} fori = 1, 2,3,..., 
n. Then {A;| i € I} = {{1, 2}, {2, 3}, ..., {n,n + 1}. Notice that this collection 
is itself a finite set, with n elements. 


n n 
We leave as an exercise the computation of U A, and (1) Aj. 
i= i= 


Now suppose that we have an infinite collection of sets {4,, A,, A3,...} 
where the indexing set is the set of all positive integers. Then for the union 
and intersection we write 


VAHRs U|x € A; for some i € Z*} 
and 
[A= ax © U|x € A;for alli € Z*}. 
For each positive integer i, let A; = (—i, i) = {x € R| -i < x < i}. Then 


the set S = {A,, Ay, A3,...} is a collection of open intervals on the real line. 


The union of the elements in S is U (—i, i) = R and the intersection is 
b= 


a (—i, i) = (-1, 1). The fact that the union is R is a consequence of the 
Archimedean Principle discussed in Section 1.2. We leave the details as 
an exercise. 


Notice that the sets A; in the previous example have a very interesting 
property: A, C A, € A; C.... That is, for all i, j © Z*, if i sj, then A; C 
A;. When this happens, the collection is said to be increasing, or to be an 
ascending chain. (We can similarly define decreasing collections or descend- 
ing chains.) 

Let A; = {i,i + 1,i + 2,...}. Then L) Ap= Zt and 0 A;= @. 
To prove the assertion about the union, first note that since each A; C 


Z*, it follows that U A; © Z*. (Prove this!) Conversely, if n € Z*, then 


n €A,,. This implies that n € U A,. Thus U A; = Z. 
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To prove that O A; = ©, we need merely show that, for any n € Z*,n 


is not in every A; or equivalently, n € A; for some i. But n € A,.:. So the 
proof is complete. 
Is the collection in this example increasing, decreasing, or neither? 


Not all infinite collections of sets can be indexed by the positive integers. 
Here is an example. 


If r is a real number, let A, = (r, ©) = {x © R| x > r}. Here the indexing 
set is the set R of real numbers. This collection is decreasing: if r < s, then 
(r, ©) D (s, ©). (As you might expect, A D B just means B C A.) 

If we want to express the union and intersection of the sets in this 
collection, the notation of Example 3 will not work. In this case, we express 


the union as U A, and the intersection as a A,. 
rr rr 


In general, we have the following notation. 


Definition 2.3.2 Let J be a set, and {A; | i € 7} a collection of sets. 
The union of the collection, denoted U A;, is {a | a € A;, for some 
ic Ih. 
The intersection of the collection, denoted a A;, is {a | a € A;, for 
alli € 7}. 


Sometimes we may wish to discuss a collection of sets without actually 
specifying a particular indexing set. We may simply say: let S be a collection 


of sets, and to denote the union and intersection we write UA and a) A. 


An example of this notation appears next in the definition of partition, 
a concept that will be important for us when we discuss equivalence relations 
in Chapter 4. 


Partitions 


Definition 2.3.3 Let A be a set. A partition of A is a subset Pof P(A) 
such that 


1. if X € Athen X # ©. 
2 Ae = AS 
XEP 
3. if X,Y € Pand X # Y, then XN Y=©. 


EXAMPLE 6 


EXAMPLE 7 


Theorem 2.3.4 


Corollary 2.3.5 


Corollary 2.3.6 
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That is, a partition of a set divides the set into different nonempty subsets 
so that every element of the set is in one of the subsets and no element is 
in more than one. 


Let A = {1,2,... 10}. Then 7 = {{1, 4, 9}, {2, 3, 5}, {6}, {7, 8, 10}} is a parti- 
tion of A but {{1, 4, 9}, {2, 3, 5}, {6}, {7, 8, 9, 10}} is not. 


Forn € Z, let [n,n +1) ={x ER|nsx<n+ 1}. Then Z= {n,n + 
1)|n € Z} is a partition of R. 


The Pigeonhole Principle 


Recall that if A is a finite set, then |A|, the cardinality of A, is the number 
of elements in A. The following result, that the cardinality of the union of 
two finite disjoint sets is the sum of the cardinalities of the two sets, may 
seem obvious, as it should, but a rigorous proof requires a more formal 
definition of what it means for a set to be finite than we have presented 
here, so we simply state the result. 


Let A and B be finite disjoint sets. Then |A U B| = |A| + |BI. 


The following corollary is an extension of Theorem 2.3.4 to any finite 
collection, A,, A2,...A,, of finite mutually disjoint sets. By mutually disjoint, 
we mean that A; N A; = © if i # j. The proof requires the Principle of 
Mathematical Induction and is given as an exercise in Chapter 5. 


Let A,, A), ... A, be a collection of finite mutually disjoint sets. Then 


IY a ry > |A]l. 


Corollary 2.3.5 can be used to give the following generalization of Theo- 
rem 2.3.4. We leave the proof as an exercise. 


Let A and B be finite sets. Then |A U B| = |A| + |B] — |A 2M Bl. 


We can now discuss the Pigeonhole Principle. In its simplest form, the 
Pigeonhole Principle says that if n objects are placed in k containers and 
n > k, then at least one container will have more than one object in it. A 
set theory version of the Pigeonhole Principle is the following: Let Aj, 


A2,...A, be a collection of finite mutually disjoint sets. Let A = U A,. If 
|A| = k and k > na, then, for some i, |A|| = 2. 
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We will show that in any list of four positive integers, at least two will have 
the same remainder when divided by 3. 

There are three possible remainders when a number is divided by 3; 
namely, 0, 1, or 2. Since there are four numbers in our list and only three 
remainders, at least two of the numbers in the list will have the same re- 
mainder. 


A generalization of the Pigeonhole Principle is the following theorem, 
which is a consequence of Theorem 2.3.4. The proof is left as an exercise. 


Let A,, Az, ...A, be a collection of finite mutually disjoint sets. Let A = 


U A;. Suppose that |A| > ar for some positive integer r. Then, for some i, 
iz 


|A| =r+1. 


Suppose that a certain small town has more than 30,000 people on its voting 
list. Each voter is classified as Democrat, Republican, or Independent. Prove 
that at least one of these classifications must contain more than 10,000 people. 

If we let A; be the set of Democrats on the list, A, the set of Republicans, 
A; the set of Independents, and A = A, U A, U As, then |A| > (3)(10,000). 
So by Theorem 2.3.7, for some i = 1, 2, 3, |A;] = 10,001. 


eae MATHEMATICAL PERSPECTIVE: AN UNUSUAL SET 


In the previous section, we mentioned the mathematician Georg Cantor in 
connection with a rigorous definition of the rational numbers. Cantor is also 
known for defining a set of real numbers with some interesting properties. 
This set has become known as the Cantor set. 

To define the Cantor set, we first define a descending chain of sets 
as follows: 

Let A, be the closed interval [0, 1]. 

We form the set A, by removing from [0, 1] its middle third; namely, the 
open interval (1/3, 2/3). Thus A, = [0, 1/3] U [2/3, 1]. 

We define A, by removing the middle thirds from each of the closed 
intervals that make up A;. So A, = [0, 1/9] U [2/9, 1/3] U [2/3, 7/9] U [8/9, 1]. 

We continue in this way to form sets A, by removing the middle thirds 
from each of the closed intervals that make up the previous set A,_;. 

Note that each A, is the union of 2" closed intervals, each of which has 
length 1/3", and that the sets A, form a descending chain. 


Finally, we define the Cantor set C by C = 0 A,. 


The Cantor set is difficult to visualize and impossible to draw with com- 
plete accuracy because its construction involves an infinite process. Even 
determining what real numbers are in the set C is not so easy. Certainly, at 
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least, the endpoints of every closed interval in each of the sets A, are in C: 
0, 1, 1/3, 2/3, 1/9, 2/9, 7/9, 8/9, .... 

A good way of determining whether or not a given real number in the 
interval [0, 1] is in C is to write the number in ternary form. The decimal 
representation of a number x in [0, 1] is of course a representation of the form 


a, Qa as 
X= O.aa203...= 75 + TE ig 
where the numbers a, @, a3, ... are integers between 0 and 9 inclusive. 
The ternary expansion of x has the form 
b, by. by; 
x =O.bib2bs...- Zt atat.-: 
where b,, b2, b3,... are from among the integers 0, 1, and 2. This representa- 


tion is unique except for numbers such as 1/3 that can be represented as 
0.100000 ... ending in a string of 0’s and also by 0.022222 ... ending in a 
string of 2’s. We will adopt the convention that the representation using a 
string of 2’s will be used instead of the one using a string of 0’s. 

As an example, we will compute the ternary representation of the number 
5/7. By simple calculation, one can show that: 2/3 < 5/7 <1, 6/99 <S/7 << 
7/9, 19/27 < S/7 < 20/27, 57/81 < S/7 < 58/81, 173/243 < 5/7 < 174/243, ... 
and so on. Thus 2/3 + 0/9 + 1/27 + 0/81 + 2/243 < 5/7 < 2/3 + 0/9 + 
1/27 + 1/81. So we get 5/7 = 0.201021201021 ... in ternary form. 

Now how do we determine whether 5/7 is in C? Note that because 
2/3 < 5/7 < 1, 5/7 is in A;. Because 6/9 < 5/7 < 7/9, 5/7 is in A,. But the 
middle third of the interval [6/9, 7/9] is (19/27, 20/27) and 5/7 is in that 
interval. Therefore 5/7 is not in A; and therefore is not in C. 

The fact that 5/7 is in the interval (19/27, 20/27) is reflected in the fact 
that the ternary expansion of 5/7 has a 1 in the third place. So in fact a 
number x will not be in C if a 1 appears at any point in its ternary expansion. 
Even more strongly, x € C if and only if the ternary expansion of x consists 
of just 0’s and 2’s. 

Let x = 0.020202 ... in ternary form where the altemating 0’s and 2’s 
continue indefinitely. Then x is in the Cantor set C. To see what number 
this ternary expansion represents, we write 


Now this is just the geometric series 5) 2/9", which converges to 1/4. So 


n=l 
1/4 is in C. 
One might ask just how “large” is the Cantor set. It is certainly an 
infinite set and also a bounded set because it is contained in the interval 
[0, 1]. We can measure the size of any subset of [0, 1] that is an interval 
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or is a union of a finite number of disjoint intervals by just adding the 
lengths of each subinterval. For example, the set S = [1/5, 3/7] U [5/8, 
9/11] has size 1649/3080. 

Even more strongly, the size of each A, is 2"/3" = (2/3)". Because C is 
the intersection of all A,’s, it must be contained in each one. Hence C has 
“size” less than (2/3)" for all positive integers n. But (2/3)" approaches 0 as 
n approaches ©. So if the Cantor set has size in any meaningful way, it must 
have size 0. 

On the other hand, it is possible to show that, in a sense that will 
be made precise in Chapter 6, C has “more”? elements than the set Q 
of all rational numbers. This is one of the paradoxical features of the 
Cantor set. In one sense it seems small and in another sense it appears 
quite large. 


Exercises 2.3 


1. Compute P(A), where 
(a) A = {4, 7, 10}. 
(b) A = {0, 2, 4, 6}. 


2. Write four elements of P(Z). 
3. Compute P(P(A)), where A = {0, 1}. 


4. Let A = {0, 1, 2, 3, 4, 5}. 
(a) Write 4 subsets of A. 
(b) Write 3 elements of P(A). 
(c) Write 3 subsets of P(A). 
(d) Write 3 partitions of A. 


5. Let A = {0, 1, 2}. Characterize the following statements as true or false. 
(a) {0} C P(A). 
(b) {1, 2} € P(A). 
(c) {{0, 1}, {1}} C P(A). 
(d) @ € P(A). 
(e) OC P(A). 
(f) {2} € P(A). 
(g) {D} C P(A). 


6. Let A and B be sets contained in a universal set U. 
(a) Prove that if A C B, then P(A) C P(B). 
(b) State the converse of part (a). Prove or disprove. 


7. Let n be a fixed positive integer and let A; = {i, i + 1} for i = 1, 2, 


3,...n as in Example 2. Compute U A; and OM A;. Prove your answer. 
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13. 


14. 


15. 


16. 


17. 
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. Let A; = (i, 1) = {x € R| -i < x < i} as in Example 3. Prove that 


U (—i, i) = Rand a (-i, i) = (-1, 1). (See Exercise 12 of Section 1.1.) 


. Let S be a set and let {A; | i € I} be a collection of subsets of S. Prove 


that UA, CS. 
iel 


Let A; = {1, 2,3, ... i} for i € Z*. Compute U A, and a A;. Prove 
your answer. 
Let A; = [i,i + 1) = {fx © R|isx<i-+ 1} fori € Z*. Compute 


U A; and 0 A;. Prove your answer. 
i= i= 


Let A; = (1/i, i] = {x € R| 1/i < x Si} for i = 2. Compute U A; and 


C) A;. Prove your answer. 
i= 


Let A; = [1, 1 + 1/i] for i € Z*. Compute U A; and OM Aj. 


Let A; = (1,1 + 1/1) for i € Z*. Compute U A; and a A;. 


Which of the collections of sets in Exercises 10-14 are increasing chains 
and which are decreasing chains? 


Let {A;, A2, A;, ...} be an infinite collection of sets indexed by the 
positive integers. Assume each A; is contained in a universal set U and 
that A; is the complement of A; in U. Prove that 


(a) U 4, = 1) A, 


b Tore a 
m Ma=U 4, 
(a) Prove that if the collection {A; | i € Z*} is decreasing, then 
LAP Ags 
iez* 
(b) Give an example of a decreasing chain {A; |i € Z*} with | A, # ©. 
iez* 
(a) Prove that if the collection {A; | i € Z*} is increasing, then 
) A; = A,. 


iez* 
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19. 


27. 


28. 


(b) Give an example of an increasing chain {A; | i € Z*} with A; C Z* 
and U A; ~ Z*. 


i€Z 


Prove that U {fj} = 1. 


. Write two partitions each of Z and R. 
21. 


Let A be a set and let S be a subset of P(A). Complete the following 
statement: “‘S is not a partition of A if... .” 


. Let Eand O denote the sets of even and odd integers respectively. Prove 


that A = {E, O} is a partition of Z. 


. Let A = 3Z,B ={n © Z|n = 3t + 1 for some t € Z} and C = {ne 


Z | n = 3t + 2 for some t € Z}. Prove that {A, B, C} is a partition of Z. 


. Prove Corollary 2.3.6. 
. Prove Theorem 2.3.7. 


. Use the Pigeonhole Principle to prove the following: Let S be a square 


whose sides have length 1. Let P,;, P., P3, Py, Ps be five points that lie 
inside S. Prove that the distance between at least two of the points 
is less than V2/2. 


The following numbers are given in ternary form. Find the real number 
that each represents. Which ones are in the Cantor set C? 

(a) x = 0.002002002 ... (with 002 repeated indefinitely). 

(b) x = 0.011011011 ... (with 011 repeated indefinitely). 

(c) x = 0.0010000000 ... (with 0’s repeated indefinitely). 


Determine whether the following numbers are in C: 
(a) 3/5 

(b) 37/50 

(c) 1/40 

Hint: 1/40 = 2/(3* — 1). 


Discussion and Discovery Exercises 


D1. Even for sets with a small number of elements, the power set can be 


fairly large. (See Exercise 1(b).) Conjecture the number of elements in 
the power set of a set with n elements. Try to prove your conjecture. 


D2. Let A and B be sets contained in a universal set U. 


(a) What can be said about the power set of A U B in relation to 
P(A) U P(B)? For example, are they equal sets? State and prove 
some results along these lines. 

(b) Compare the power set of A M B with P(A) M P(B). State and 
prove any results you can. 


3.1 


FUNCTIONS 


DEFINITION AND Basic PROPERTIES 


The idea of a function is everywhere in such mathematical subjects as trigo- 
nometry, analytic geometry, and, of course, calculus. These courses deal 
almost exclusively with real numbers only, so you are no doubt used to this 
type of expression: 


f(x) =x? or f(x) = Vx or f(x) = cosx. 


But we will be dealing with many other sets, and so we will need to 
generalize what we mean by the word “‘function.” 

You probably think of a function as a formula that allows you to take a 
number and compute a value; in the first example above, you just multiply 
the number by itself. But what’s important is not that you do (or even could 
do) some algebraic manipulation, but rather that you can find the value; 
better yet, that there simply is a single value associated with, or assigned to, 
a given object. In this spirit, we define a function between arbitrary sets. 


Definition 3.1.1 Let A and B be nonempty sets. A function f from A 
to B is a rule that assigns to each element in the set A one and only 
one element b in the set B. 

We call A the domain of f and B the codomain; we write f: A — B, 
and for each a € A, we write f(a) = b if b is assigned to a. 
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The idea of a function can be suggested by the following simple diagram: 


How we find f(a) is not important; what matters is that it exists. Sometimes 
we say that f maps a to f(a) or a is mapped to f(a) by f. 

Note that in defining a function f, we may simply state the domain and 
codomain of f and specify what f(x) is for any x in the domain. This specifica- 
tion may be done by one or more formulas or by some verbal description. 

From now on, throughout the text, whenever we discuss a function f: 
A — B, it will be understood, if not explicitly stated, that A and B are 
nonempty sets. 


Let A = ({1, 6, 12}, B = {0, 2, 12}. Define f: A > B by f(1) = 2, f(6) = 0, 
and f(12) = 12. Note that there need not be any specific “‘formula’’ for f, 
but since the domain has only three elements it is easy to describe the 
assignment of an element of B to each element of A. Of course, there are 
other functions from A to B. 


With A and B as in the previous example, let g: A — B be defined by 
g(1) = 0, g(6) = 0, and g(12) = 0. Notice that distinct elements of the 
domain may be assigned the same element of the codomain. In this example, 
g maps every element of A to 0. 


For any nonempty set A, we can define a very simple function i,: A > A, 
called the identity function on A, by i,(a) = a Va € A,; that is, i, maps every 
element of A to itself. 


Let a, b € R, and define f: R > R by f(x) = ax + b. This is called a linear 
function (you may have recognized the form from analytic geometry); it is 
constant (that is, f(x) = b Wx € R) if a = 0 and nonconstant otherwise. 


Let f: R > R be defined by f(x) = x’. The domain of fis, of course, R, and 
the codomain of fis also R. But not every real number is an assigned value 
by means of this function; only the nonnegative ones are. That is to say, the 
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codomain is in some sense “‘too big.’’ For this reason, we single out a subset 
of the codomain, by making the following definition. 


Image of a Function 


Definition 3.1.2 Let f: A — B be a function. The image of the function 
fis Im f = {y € B| y = f(x) for some x € A}. 
More generally, if X C A, we define 


f(X) ={y € B| y = f(x) for some x € X}. 
f(X) is called the image of X under f. (Note that the image of f is 


just f(A).) 
The graph of f is the set {(a, b) € A X B| b = f(a)h. 


Note: A consequence of this definition is that for any function f, f(@) = ©. Do you 
see why? 


As above, we can suggest the definition of Im f with a simple diagram: 


Similarly, for subsets X C A, the definition of f(X) leads us to this di- 
agram: 
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The image of a function f, then, is always a subset of the codomain. In 
fact, if we have a function f: A — B, it is always possible to consider a very 
much related function g: A — Im f, which we would define by g(x) = f(x), 
Vx € A. A subtle point here is that if Im f # B, then, despite how much 
they look alike, f and g really are different functions because they have 
different codomains. 

So, it is important to define just what we mean by two functions being 
equal. 


Definition 3.1.3 Two functions f and g are equal if they have the same 
domain and the same codomain and if f(a) = g(a) for all ain the domain. 


Let’s examine these ideas with a few more examples. 


Let A = {1, 6, 12}, B = {0, 2, 12}, and let f be defined as in Example 1. Then 
the image of fis {0, 2, 12}, the same as the codomain. If X = {1, 12}, then 


f(X) = {2, 12}. 


Let g be as defined in Example 2. Then Im g = {0}. Here the image is not 
the same as the codomain. 


Consider the ordinary sine function from trigonometry, which we can express 
as sin: R — R. The codomain of this function is R, but we know that all 
sine values are between —1 and 1 and that every real number between 
—1 and 1 is the sine of some number; that is, Im sin = [—1, 1]. In fact, 
sin([—7, 7]) = [—1, 1]. 


Define f: R — Z by the following rule: f(x) is the greatest integer less than 
or equal to x. For obvious reasons, f is called the greatest integer function, 
and we write f(x) = [x], so that, for example, [6.5] = 6, [7] = 3, [-1.2] = 
—2, and [2] = 2. 

In this case, the image of f is the entire codomain, since any integer is 
the greatest integer less than or equal to, among other things, itself. 

Note that the image of Z under f is also the codomain Z. 


We now consider some examples in which we give formal proofs that 
the image of a given function is a certain set. Proofs like these are not always 
easy. First of all, it is important to understand exactly what it means for an 
element of the codomain of a function to be in the image of that function. 
Specifically, if f: A — B is a function and y € B, then y € Im fif and only 
if there exists an element x in A such that f(x) = y. In some cases we will 
actually solve for x in terms of y. In other cases it is difficult, if not impossible, 
to solve for x and we merely show that x exists. 
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Let f: R — R be defined by f(x) = x’. We claim that Im f = S where S = 
{fy ER] y = Of. 
Clearly, Im f C S, because the square of any real number is nonnegative. 
Conversely, suppose that y € S. Letting x = Vy, it follows that f(x) = 
x? = (Vy)? = y, so that y € Im f. Therefore, S C Im f and we have Im f= S. 


In Example 10, we assumed some well-known properties of the real 
numbers; namely, that the square of any real number is nonnegative and 
that every nonnegative real number has a square root. We will continue to 
assume these and other properties of the real numbers throughout this 
chapter. For proofs, see Chapter 7. 


Let f: R > R be defined by f(x) = 3x + 7. You may recognize this function 
as a linear function whose graph is a straight line of slope 3. You could 
conclude from the graph that the image of fis R. Here is a proof. 

Clearly, Im f C R. Let y € R. We must find an x in R such that f(x) = 
y. It is easy to find x. Just take the equation y = 3x + 7 and solve for x in 
terms of y. We get x = (y — 7)/3. We leave it to the reader to check that 
f(x) = y. It now follows that Im f = R. 


Let f(x) = 3x + 7 as in the previous example. We will compute f(X) where 
X is the closed interval [0, 1]. A look at the graph of f will lead us to conclude 
that f(X) = [7, 10]. 
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We now give a rigorous proof. 

As in the previous example, we are proving that two sets are equal. So 
we will prove that each set is a subset of the other. 

First, let y € f(X). Then y = f(a) for some a € [0, 1]. Because 0 Sa = 
1, it follows from properties of inequalities that 7 = 3a + 7 = 10. Hence 
y € [7, 10]. Therefore, f(X) C [7, 10]. 

Now let y € [7, 10]. We know from the previous example that f(x) = y 
where x = (y — 7)/3. Thus to show that y € f(X), it is sufficient to show 
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that x € [0, 1]. But since 7 = y = 10, it follows easily that 0 = (y — 7)/3 S 
1. Hence [7, 10] C f(X). We have now proved that f(X) = [7, 10]. 


Let f: Z — Z be defined by 


( jee ifniseven 
TY nia: Stated: 


We will compute f(E) where E is the set of even integers. From the 
definition of f, it seems reasonable to expect that f(E) = O where O is the 
set of odd integers. We will now prove it. Remember. To prove two sets 
equal, we usually prove that each is a subset of the other. 

First, let n € E. Then f(n) =n + 1 and since n is even, n + 1 is odd. 
Thus f(n) € O, proving that f(E) C O. 

Now let m € O. To prove that m € f(E), we must find ann € E such 
that f(m) = m. Well, clearly m — 1 is even and f(m — 1) =m—1+1= 
m. This proves that m € f(E) and hence O C f(E). It follows that f(E) = O. 


Let f be the function of the previous example. In this example, we will 
compute f(X), where 


X={n €Z|n=4t+ 1 for some t € Z}. 


A good way to start is to try some values. Some elements in X are: —3, 
1,5, and 9, and f(—3) = —6, f(1) = —2, f(5) = 2, f(9) = 6. Each one 
of these values is two more than a multiple of 4, so we can reasonably 
conjecture that 


f(X) ={n © Z| n = 4t + 2 for some t € Z}. 


Now let’s prove it. 

We’ll denote the set {n € Z|n = 4t + 2 for some t € Z} by S and 
prove that f(X) = S. Letn € X. Then n = 4¢ + 1 for some integer ¢. Son 
is odd and 


f(n) = f(4t+ 1) = 4¢+1-3=4t-2=4(t- 1) +2. 


We see that f(n) is in S since it is 2 more than a multiple of 4. Thus 
F(X) CS. 

Let m € S. Then m = 4t + 2 forsome tin Z. We want to find an element 
n © X such that f(n) = m. Since m + 3 is odd and f(m + 3) = m + 
3 — 3 = m, it makes sense to let n = m + 3. To complete the proof, we 
must show that m + 3 © X. But 


m+3=4t+2+3=4+5=4¢+1)+1EX. 
So S C f(X) and we conclude that f(X) = S. 


For some functions, computing their images can be difficult and the 
methods we used for the previous examples will not suffice. Sometimes the 
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methods of calculus can be used to great advantage. In the next two examples, 
we will use the Intermediate Value Theorem. We state it here without proof, 
because a formal proof belongs in an introductory text in real analysis or 
advanced calculus. 


Intermediate Value Theorem Let f be a function whose domain and codo- 
main are subsets of R. Assume that f is continuous on the closed interval 
[a, b]. If y is any number between f(a) and f(b), then there is a real number 
x in [a, b] such that f(x) = y. 


Consider the function f: R > R defined by f(x) = x? + 4x + 1. The first 
derivative of f(x) is the function f’(x) = 3x’ + 4, which is always positive. 
Thus f(x) is an increasing function and has no relative max or min points. 
We also see that lim,_,.. f(x) = © and lim,.,-.. f(x) = —%. Since fis continuous, 
the Intermediate Value Theorem tells us that any real number is in the 
image of f. We can see this more clearly by looking at the graph of f: 


fx) =x34+ 4x41 
40 


20 


-3 —2 -1 0 1 2 3 


Here is a formal proof that Im f = R. Since Imf C R, it suffices to show 
that if y € R, then dx CR 5B f(x) = y. 

Let y € R. The fact that lim,.,.. f(x) = © essentially means that the 
function f will eventually surpass any given number if x is taken large enough; 
in particular, it will eventually surpass y. Thus there is a real number M > 
0 such that f(x) > y whenever x > M. Similarly, since lim,-. f(x) = —™, 
there is a real number N < 0 such that f(x) < y whenever x < N. So there 
exist real numbers x, and x, where x, < x, such that f(x,;) < y < f(x). Since 
f is continuous everywhere, it is in particular continuous on the interval 
[x:, X,]. Then by the Intermediate Value Theorem there exists x € [x1, x2] 
such that f(x) = y. 


Let f be the function of the previous example. We will now compute f(X) 
where X is the closed interval [0, 1]. Since f(0) = 1 and f(1) = 6, it follows 
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from the Intermediate Value Theorem and the fact that f is increasing that 
f(X) = [1, 6]. We leave the formal proof as an exercise. 


Let f: R = R be defined by f(x) = x? — 3x? + 1. Let’s compute f(X) where 
X is the closed interval [1, 3]. Since f(1) = —1 and f(3) = 1, it may seem 
that f(X) = [-1, 1]. However, fis not an increasing or a decreasing function 
on [1, 3]. In fact f has a local minimum at x = 2 and f(2) = —3. We claim 
that f(X) = [-3, 1]. 

We first draw the graph of f. 


fx) = 33 — 3x2 +1 


1 1.5 2 2.5 3 


It is easy to see from this graph why f(X) = [-3, 1]. We now give a 
formal proof. First note that f is decreasing on [0, 2] and increasing on 
[2, ©]. Let x € [1, 3]. If 1 sx S2, then —3 = f(2) S$ f(x) S f(1) = —-1. So 
f(x) € [-3, 1]. If 2 = x $3, then —3 = f(2) S$ f(x) S f(G) = 1. Again we 
have that f(x) € [-3, 1]. This proves that f(X) C [-3, 1]. Now let y € [-3, 
1]. Since y is between f(2) and f(3) and f is continuous, by the Intermediate 
Value Theorem there exists x € [2, 3] such that f(x) = y. Hence y € 
f({2, 3]) and since [2, 3] C [1, 3], we get that y © f({1, 3]). Therefore, 
[-3, 1] C f([1, 3]). Hence f(x) = [-3, 1]. 


If f: A > B is a function and if X and Y are subsets of A such that X is 
a subset of Y, it seems reasonable that f(X’) will be a subset of f(Y). We 
prove that result now. 


Let A and B be sets and let f: A — B be a function. Let X and Y be subsets 
of A. If X C Y, then f(X) C f(Y). 


PROOF: To prove that f(X) C f(Y), we must take an arbitrary element in 
f(X) and prove that it is in f(Y). So we let y € f(X). Then y = f(x) for 
some x € X. Since X C Y, it follows that x € Y and therefore y € f(Y). 
This proves that f(X) C f(Y). @ 


Let’s take a moment to analyze the proof of Proposition 3.1.5. The 
statement of the proposition is of the form P = Q where P is the statement 
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X C Y and Q is the statement f(X) C f(Y). It is statement Q that we want 
to prove, so we ignore statement P for the moment. To prove statement Q, 
we have to prove that one set is a subset of another set. The usual way of 
doing this is to take an arbitrary element of the first set and prove that it is 
in the second set. That is why in the proof of Proposition 3.1.5, we start by 
taking y € f(X). Now what does it mean for y to be an element of f(X)? 
By the definition of f(X), y must be the function f evaluated at an element 
of X. That is why we write y = f(x) for some x € X. We then have to ask 
ourselves what we need to do in order to show that y is in f(Y). Well, by 
the definition of f(Y), we need to show that y is the function f evaluated at 
an element of Y. Since y = f(x), we will be done if we can show that 
x € Y. This is where statement P comes in. The fact that we are assuming 
P to be a true statement means that since x is in X, x is also in Y. Finally 
we can conclude that y € f(Y), proving that f(X) C f(Y). 

In proving a theorem of the form P => Q, then, it is often advantageous 
to focus first on statement Q and develop a strategy for proving it. In devel- 
oping this strategy you need to be mindful of the fact that statement P will 
be used in the proof. 


The converse of Proposition 3.1.5 is false. (See Exercise 23 following this 
section.) The reason that it is false boils down to the fact that if f(a) € f(X) 
for some a € A, it does not necessarily follow that a © X. For example, if 
f: R— R is defined by f(x) = x’ and X is the set of positive real numbers, 
then f(—1) € f(X) since f(—1) = f(1) and 1 € X, but -1 € X. 

The following result tells us how the union and intersection of sets are 
affected when we take their image under a mapping. 


Let A and B be sets and X and Y subsets of A. Let f: A — B be a func- 
tion. Then 


1. f(XU Y) = f(X) U f(Y). 
2 f(XNY) CX) NF). 


PROOF: We will prove 1 and leave 2 as an exercise. 

Recall that if we wish to prove two sets equal, it is usually necessary to 
prove that each one is a subset of the other. 

Let b € f(X U Y). Then b = f(a) where a € X U Y. Soaisin X or Y. 
If a € X, then b € f(X) and if a € Y, then b € f(Y). 

Thus b € f(X) or b € f(Y), which means that b € f(X) U f(Y). We 
have now proved that f(X U Y) C f(X) Uf(Y). 

On the other hand, suppose that b € f(X) U f(Y). Then b € f(X) or 
be f(Y). 

If b € f(X), then b = f(x) for some x € X. Since x € X,x EX UY 
and so b € f(X U Y). Similarly, if b € f(Y), then b € f(X U Y). 

In either case, we get b © f(X U Y) and therefore f(X) U f(Y) C 
F(X U Y). 

It now follows that f(X U Y) = f(X) Uf(Y). @ 
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Note that we do not have strict equality of sets in statement 2 of Proposi- 
tion 3.1.6. One can give examples of functions where f(X N Y) 4 f(X) N 
f(Y). See Exercise 15 following. 


Inverse Image 


Besides computing the image of a subset of the domain of a function, we 
can also talk about what is known as the inverse image of a subset of the 
codomain. It is defined as follows. 


Definition 3.1.7 Let f: A — B be a function and let W be a subset 
of B. Then the inverse image of W with respect to f is the set {a € A | 
f(a) © W}. We denote this set by f-'(W). 


+= 


© 


So f-'(W) is the set of elements of the domain of f that gets mapped to 
an element of W. 

It is important to realize that in this context the symbol f~! does not 
refer to the inverse of the function f. Such an inverse may not even exist! 
Inverse functions are discussed in the next section. 


Let A = {0, 1, 2, 3, 4,5} and B = {7, 9, 11, 12, 13}. Let f: A — B be defined 
by (0) = 11, F() = 9, f(2) = 7, f3) = 9, f(4) = 11, and f(5) = 9. 

Then, for example, f~'({7, 9}) is the set of all integers in A that get 
mapped to 7 or 9 by f. 

Since f(1) = 9, f(2) = 7, f(3) = 9, and f(5) = 9, it follows that 
"CT 9) = {8 2, 3, Sh. 

Similarly, f-1({11, 12}) = {0, 4} and f7'({12, 13}) = ©. 


Let f: R — R be defined by f(x) = 4x + 5. Let W = {x © R| x > 0}. Then 
f(W) is the set of all real numbers x such that f(x) € W. By the definition 
of W, f(x) © W if and only if f(x) > 0. Thus 


f(W) = {x © R| 4x + 5 > 0} = (-5/4, &). 
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If Z is the open interval (9, 13), then 
f\(Z) = {x ER|9 < 4x + 5 < 13} = (1, 2). 


Let f: Z — Z be defined by 


if if nis even 
f(n) = 2’ 
2n + 4, if nis odd. 


It is easy to see by computation that f~'({6, 7}) = {1, 12, 14}, for example. 

Now let’s compute f~'(O) where O is the set of odd integers. We must 
find all integers n that get mapped by f to an odd integer. 

Since 2” + 4 is always even, we see that no odd integer gets mapped to 
an odd integer. On the other hand, some even integers get mapped to odd 
integers and some do not. For example, 2, 6, and 10 get mapped to odd 
integers and 4, 8, and 12 do not. It seems that those even integers that are 
not multiples of 4 get mapped to odd integers and the multiples of 4 get 
mapped to even integers. 

An even integer that is not a multiple of 4 can be expressed in the form 
4t + 2 where ¢ is any integer. So we will prove 


f-\(O) = {n € Z| n = 4t + 2 for some t € Z}. 


First, let n € Z such that n = 4t + 2 for some t € Z. Then f(n) = 
f(4t + 2) = 2r +1 © O. Hence n € f-'(O). So we have {n € Z| n = 4t + 
2 for some t € Z} C f-'(O). 

Conversely, suppose that n € f-1(O). Then f(n) is odd. It follows that 
f(n) # 2n + 4, so n is even. Thus f(n) = n/2 and since f(n) is odd, f(n) = 
2t + 1 for some t € Z, giving us n = 4t + 2. 

Consequently, f-'(O) C {n € Z| n = 4t + 2 for some t € Z} and we 
may now conclude that 


f-\(O) = {n € Z|n = 4t + 2 for some t € Z}. 


Analogous to Proposition 3.1.6 is the following result on the inverse 
images of unions and intersections. 


Let f: A > B be a function. Let W and Z be subsets of B. Then 


1. f-\(W U Z) = f-(W) U F(Z) 
2. f-(W A Z) = f-(W) n f-(Z) 


PROOF: 1. As in the proof of Proposition 3.1.6, we wish to prove that two 
sets are equal and therefore we will prove that each is a subset of the other. 
First, let y € f '(W U Z). Then f(y) © W U Z, which means that f(y) € 
W or f(y) € Z. If f(y) € W, then y € f-\(W) or if f(y) € Z, then y € 
f-(Z). Thus y © f-\(W) U f-Z) and we have f'\(W U Z) C 
fW) UFZ). 
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Conversely, suppose that y © f-'\(W) U f-(Z). Then y € f-'(W) or 
y Ef (Z). If y € f-'(W), then f(y) € W or if y € f-(Z), then f(y) € Z. 
In either case, f(y) € W U Z, which implies that y € f-'(W U Z). We have 
now proved that f'(W) U f-(Z) Cf (WU Z). 

We can now conclude that f'\(W U Z) = f-'(W) U f-\Z). 

2. The proof of 2 is left as an exercise. © 


We end this section with a result on functions whose domains are finite sets. 


Let A and B be sets and let f: A — B be a function. Assume that A is a 
finite set. Then Im f is a finite set and |Im f| S |A|. 


PROOF: Let A = {a;, a, ... a,}. Then the image of f consists of the distinct 
elements from among f(a), f(a2), ... f(a,). Hence Im f is a finite set with 
at most n elements. « * 


a» HISTORICAL COMMENTS: THE IDEA OF A FUNCTION 


The very general, abstract definition of function given in this section is 
probably not the view that you learned in previous mathematics courses, 
and it is certainly not the way that mathematicians have always approached 
the idea. For example, as calculus was being developed in the 17th century, 
geometric objects, curves, were the focus of attention. Newton and Leibniz 
worked by attaching other geometric objects, such as tangent lines, to curves; 
they did not deal with equations of curves as we do today. 

It was not until the 18th century that Leonhard Euler (1707-1783) began 
the shift to the formulas that you are so familiar with from high school 
algebra. His ‘“‘analytic expressions” encompassed more than polynomials 
and rational functions: he studied combinations of these and trigonometric, 
logarithmic, exponential, and power functions (such as roots), as well as 
their derivatives and integrals. Importantly, he even allowed infinite series 
of these functions. Euler’s monumental ‘“‘calculus book,” the [ntroductio in 
Analysin Infinitorum, published in 1748, emphasizes the idea of a function 
as an algebraic formula by not including any art or graphs. 

For a time, mathematicians hoped that all functions could be represented 
by formulas. In 1822, J. B. J. Fourier (1768-1830) even attempted to prove 
that any function f: (—a, a) — R can be written as an infinite sum of 
trigonometric functions (in fact, as just a sum of sines and cosines). We 
understand today that Fourier’s proof of this statement is not completely 
rigorous, but his efforts led to more intense scrutiny of the concept of a 
function. Soon P. G. L. Dirichlet (1805-1859) gave a definition of function 
that is essentially the same as ours, and as a counterexample to Fourier’s 
result, defined the following remarkable function: 


0, if x is rational 


d(x) = | 


1, if x is irrational. 
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Obviously, this function is not given as an analytic expression, to use 
Euler’s term. But it is also not a geometric curve that Newton or Leibniz 
would have understood: it’s impossible to draw because it is discontinuous 
everywhere. It can only be understood in the sense of our definition, as a 
rule or a simple pairing of elements in the domain with elements in the 
codomain. (The fact that it is discontinuous everywhere is a consequence of 
the fact that between any two rational numbers there is an irrational number 
and between any two irrationals there is a rational. A proof of this fact is 
given in Chapter 7, Theorem 7.2.6.) 

Today, mathematicians universally accept our abstract notion of a func- 
tion. But there is a price: we must always suspect our intuition. The Dirichlet 
function led to the creation of many other “‘pathological’’ functions, perhaps 
culminating in 1872 when K. Weierstrass (1815-1897) gave an example of 
a function f: R — R that is continuous everywhere but does not have a 
derivative anywhere. Intuitively, its graph is unbroken but has a sharp corner 
at every point! In the words of H. Poincaré (1854-1912): ‘“‘Logic sometimes 
makes monsters.” 

The article by Israel Kleiner [11] gives much more detail about the 
historical development of the fundamental notion of function. 


Exercises 3.1 


1. Write out three functions with domain {0, 1, 2, 3, 4, 5} and codomain 
{6, 7, 8, 9, 10}. Make at least one of the functions have the property that 
its image equals its codomain. 


2. Write out three functions with domain {2, 3, 4, 5} and codomain {6, 7, 
8, 9, 10}. Explain why you cannot define a function between these two 
sets for which its image equals its codomain. 


3. For the following functions, compute first the image of f and then the 
image of the given subset X. (No formal proofs are necessary.) 
(a) f: Z— Z, f(n) = 2n + 1; X = E, the set of even integers 
(b) f: RR, f(x) = 2x + 1; X = [-1, 4] 


2n, ifmis even 
(c) f:Z>Z, f(n) = rat ;X=E 
n, if nis odd 
(d) f: R>R, f(x) = cos x; X¥ = }o.5| 
(e) f: R* > R*, where R* is the set of nonzero real numbers, f(x) = 
1 
a X = (0, 1] 


(f) f: ROR, f(x) = es X = {x ER | x > 0} 
(g) f:Z X ZZ, f(n,m) =nm, X=EXZ 
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10. 


(h) f: Z— Z X Z, f(n) = (n,n); X = E 
(i) FRX RY RB fle y) = 5 X = 1G, 1) |x © R} 


. Which of the following are functions from R to R? For those that are 


functions, find their image. (Drawing a graph will help in determining 
the image. No formal proof is necessary.) 


(a) f(x) = tVx 


: -{ t. aix=0 
aa et ee 

aa ifx=1 
2 1 an ae 
: af ffx 0 
TONS ake ign 

“ar ifx=2 
(e) fe) = Ge, Beez 


. Let f: R— R be defined by f(x) = 6x + 5. 


(a) Prove that Im f = R. 
(b) Compute f([1, 4]). Prove your answer. 


. Let f: Z — Z be defined by f(n) = 2n + 1. What is wrong with the 


following proof that Im f = Z? 


Let y € Z. Then (2) = (24) +1=y. Hence y € Imf. 


Therefore, Z C Im f. Since Im f C Z by definition, it follows that 
Im f = Z. 


. Let A and B be sets and let f: A — B be a function. Complete the 


following statement: An element b in B is not in Im fif.... 


. Let f: R= R be defined by f(x) = x’. 


(a) Compute f({0, 1]). Prove your answer. 
(b) Compute f({—1, 1]). Prove your answer. 


. Let f: R — Z be the greatest integer function. 


(a) Compute f([0, 1]). Prove your answer. 
(b) Compute f(Z). Prove your answer. 


n+ 2, ifnis even 
Let f: Z > Z be defined by f(n) = a Let E denote 
2n, if nis odd. 


the set of even integers and O the set of odd integers. 

(a) Prove that Im f = E. 

(b) Prove that f(E) = E. 

(c) Prove that f(O) = {n € Z| n = 4t + 2 for some t € Z}. 
(d) Let X be the set of multiples of 4. Prove that f(X) = f(O). 


3. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
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Let f: R— R be defined by f(x) = x3 + 4x + 1 as in Example 15. Let 
X = [0,1]. Prove that f(X) = [1, 6]. (See Example 16.) 


Let f: R — R be defined by f(x) = 2x? + 3x? — 12x + 1. Compute the 
following sets. 

Hint: Draw the graph of f first. 

(a) Im f (b) F([0, 1) 


Give formal proofs to your answers in Exercise 12. 


Reminder: Each proof should involve proving that two sets are equal, as in 
Exercise 11. 


Let f: R= R be defined by f(x) = x4 + x’. 

(a) Compute the image of f. 

(b) Prove that your answer to part (a) is correct. 
(c) Compute f([-1, 2]). 

(d) Prove that your answer to part (c) is correct. 


Let A and B be sets and X and Y be subsets of A. Let f: A ~ B be 

a function. 

(a) Prove that f(X N Y) C f(X) 1 f(Y). 

(b) Give an example of sets A and B and a function f: A — B for which 
f(X NY) # f(X) N f(Y) for some subsets X and Y of A. 


Let A and B be sets and X and Y be subsets of A. Let f: A > B be 

a function. 

(a) Prove that f(X) — f(Y) C f(X — Y). 

(b) Give an example of sets A and B and a function f: A > B for which 
f(X) — f(Y) # f(X — Y) for some subsets X and Y of A. 


Let A = {1, 2, 3, 4, 5}, B = {a, b, c, d, e}, and define f: A > B by 
fQ) = a, f(2) = c, f(3) = 2 f(4) = a, and f(5) = d. Find each of 
the following. 

(a) f-'\(B) (b) f(a, c, d, e}) 

(cy: f la,.b; c;2}) (d) f-"({b}) 


For the following functions, compute the inverse image of the given 

subsets of the codomain. (No proofs are necessary.) 

(a) f: Z > Z, f(n) = 3n + 1; W, = E, the set of even integers, W, = 
{4}, W3 = {1, 5, 8}. 

(b) f: RR, f(x) = 3x + 1; W, = {4}, W. = {1, 5, 8}, W3 = (4, ©), 
W, = (2, 4), W; = Z, Ws = E, the set of even integers. 

(c) f:R —R, f(x) = cos x; W, =[-1, 1], W. = {x E R| x = 0}, W; = Z. 

(d) f: RR, f(x) = e*; W, = [-1, 0], W. = {x © R | x = 0}, W; = {1}. 

n, ifn is even 


:Z Z, = ;W, = E, W, = {1}, W; = 
ea ae ae if nis odd : 2 = {, Ws 


{6}, W, = O, the set of odd integers. 
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19. 


20. 


21. 


22) 


23. 
24. 


(f) f: R — Z, the greatest integer function; W, = {0, 1, 2}, W. = Z’*, 
Ww; = E. 
(g) f: Z—> ZX Z, f(n) = (n, n); W, = Z X {0}, W. = Z* x {-1}. 


(h) RX R* > R, fx, y) = 55 Wi = (0, We = RS Ws = br | x > Ob 


if nis even 
Let f: Z — Z be defined by f(n) = 2 

4, if n is odd. 
(a) Compute f7!({11, 12, 13, 14, 15}). 
(b) Compute f-\(E). Prove your answer. 


Let f: A — B be a function. Let W and Z be subsets of B. Prove that 
FW Z) = fw) 0 F(Z). 


Let f: A — B be a function. Let X be a subset of A. 

(a) Prove that X C f-'(f(X)). 

(b) Given an example of a function f: A — B for some A and B and a 
subset X of A such that X¥ # f-(f(X)). 


Let f: A — B bea function. Let W be a subset of B. 

(a) Prove that f(f-'(W)) C W. 

(b) Give an example of a function f: A — B for some A and B and a 
subset W of B such that f(f'(W)) # W. 


State the converse of Proposition 3.1.5 and then prove that it is false. 


Let A and B be sets and let f: A — B be a function. Is the converse of 
Proposition 3.1.9 true? Specifically, if Im fis a finite set, does it follow 
that A is a finite set? Prove your answer. 


Discussion and Discovery Exercises 


D1. 


D2. 


A real-valued function f(x) is said to have a local maximum at a point 
c if there is an open interval (a, b) in the domain of f containing the 
point c such that f(x) = f(c) for all x in (a, b). 

(a) Write the negation of this definition; that is, f(x) does not have a 
local maximum at c if.... 

(b) A standard theorem of calculus states that if a real-valued function 
f(x) has a local maximum at c, then either f’(c) = 0 or f’(c) does 
not exist. The usual proof of this result starts out by first assuming 
that f’(c) does exist and then proving that f’(c) = 0. Using appro- 
priate statements P, Q, and R and rules of logic discussed in Chapter 
1, explain why this is a valid method of proof. 


Look up the statement of the Mean Value Theorem in a calculus text. 
(a) Identify any quantifiers in the statement. 
(b) Write the negation of the Mean Value Theorem. 


3.2 


EXAMPLE | 
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(c) State a theorem that is a consequence of the Mean Value Theorem. 
Explain why it is a consequence. 


D3. (a) Write the negation of the Intermediate Value Theorem. 
(b) Give an important application of the Intermediate Value Theorem 
in calculus. 


D4. Criticize the following “proof” that if f: Z — Z is defined by f(n) = 
2n, then f(O) = {y € Z| y = 4n + 2 for some n € Z}. 
Let x € O. Then x = 2n + 1 for some n € Z. Then f(x) = 2x = 
4n + 2. Therefore, f(O) = 4n + 2. 


D5. Prove that Dirichlet’s function d(x) defined at the end of this section 
is discontinuous at every real number c. If you are unfamiliar with the 
formal e« — 6 definition of limit, then give an informal, intuitive proof. 


D6. Give an example of a function f: R — R that is discontinuous at every 
integer but continuous at every other real number. Give at least an 
informal proof to justify your example. You may want to start by drawing 
a picture of such a function and then finding a formula for it. 


SURJECTIVE AND INJECTIVE FUNCTIONS 


Surjective Functions 


In Section 3.1, we saw some examples of functions whose images equaled 
their codomains and we saw some whose images were proper subsets of 
their codomains. These examples suggest that we distinguish the notion of 
a function whose codomain and image coincide. 


Definition 3.2.1 Let f: A — B be a function. Then f is surjective (or a 
surjection) if the image of f equals the codomain of f. 


In the diagram following Definition 3.1.2, if f: A — B is surjective, the 
two ovals representing Im f and B would coincide. This leads many text 
authors (and many mathematicians) to use the word onto instead of sur- 
jective. 

Proving that a function f is surjective, then, entails proving that two sets 
are equal, a scenario we’ve encountered before. But in this case, we know 
that the image is by definition a subset of the codomain, so it suffices to 
show that the codomain is a subset of the image: Vy € B, ax € A 5 f(x) = y. 


Let A = {1, 6, 12}, B = {0, 2, 12}. Define f: A — B by f(1) = 2, f(6) = 0, 
and f(12) = 12, as in Example 1 of Section 3.1. Since the image of f is 
{0, 2, 12}, the same as the codomain, f is surjective. 
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Let g be as defined in Example 2 of Section 3.1. Then Im g = {0}, so g is 
not surjective. 


Let A be any set and let i,: A — A be the identity function as defined in 
Example 3 of Section 3.1. It is easy to see that i, is surjective because if a 
is in the codomain A, then a = i,(a), implying that a is in the image of i,. 


Let A and B be sets. Define 7;: A X B > A by 7,(a, b) = a and m: A X 
B — B by 7,(a, b) = b. These functions are called projections and are 
both surjective. (See Exercise 4.) For example, if A = B = R, then 7, maps 
a point in the Cartesian plane to its x-coordinate. 


Any nonconstant linear function f: R — R is surjective. Suppose f(x) = 
ax + b where a # 0. To prove that f is surjective, we need to show that 
R CImf. 

Let y € R. To show that y is in Im f, we need to find x € R such that 
f(x) = y. To find x, we take the equation y = f(x) = ax + 5 and solve for 
x in terms of y. We get x = (y — b)/a. Since a # 0, (y — b)/a © R and 
f(y — b)/a) = a((y — b)/a) + bb =y — b + b = y. Thus the x we needed 
to find has been found: x = (y — b)/a. This proves that f is surjective. 


A reminder of the nature of this last proof: we were proving a statement 
that begins with the existential quantifier, 4. Therefore, all we had to do 
was present the element we needed. How we found it (obviously we just 
solved a linear equation for it) might be interesting or illuminating, but was 
unnecessary for the proof. 


Let f: R > R be defined by f(x) = x. This function is surjective. To see 
this, let y © R. Now every real number has a unique cube root. So we let 
x= Vy. Then 


f(x) = 3 = (Vy)? = y. 
Hence y is in the image of f. Since y was chosen arbitrarily, we have 


proven that every element of the codomain is in the image of f. It follows 
that fis surjective. 


Consider the function f: Z — Z defined by 


n+2, ifn is even 


es ( +1, ifn is odd. 


Note that f maps an even integer to an even integer and the odd integers 
get mapped to every other odd integer. So we surmise that fis not surjective. 

To prove this formally, we need only find one element of Z that is not 
in the image of f. We claim that 5 is such an element. 


EXAMPLE 8 


EXAMPLE 9 


EXAMPLE 10 
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Suppose, on the contrary, that 5 = f(m) for some n € Z. If n is even, 
then f(m) =n + 2 = 5. Hence n = 3, a contradiction to the fact that n is 
even. On the other hand, if n is odd, then f(n) = 2n + 1 = 5, which gives 
n = 2, again a contradiction since n is odd. 

Thus 5 is not in the image of f and therefore f is not surjective. 


Let f: Z — Z be defined by 


i = if nis even 
He erg” Ae naacta: 


We will prove that fis surjective. We need to show that every element 
of Z is in the image of f. Let y © Z We want to find n € Z such that 
f(n) = y. Given the definition of f, it seems reasonable to expect that n will 
vary depending on whether y is even or odd. 

Suppose y is even. Since f maps even integers to odd integers and odd 
integers to even integers, the 1 we pick should be odd and from the definition 
of f, we can see that we should let n = y + 3. Then n is odd and f(n) = 
f(y + 3) = y + 3 —3 = y. This proves that y € Im f. 

Now suppose that y is odd. Let n = y — 1. Then, since n is even, we get 
f(x) = f(y - 1) =y —1+4+1 = y. Again we have y € Im f. Therefore, f 
is surjective. 


Let f: Z — Z be defined by 


hs 
f(n) = 2° 
n+4, — ifnisodd. 


ifn is even 


As in the previous example, f is surjective; however (perhaps surprisingly) 
we do not have to divide the proof into even and odd cases. The key is to 
note that, whereas f maps the odd integers to some of the even integers, f 
maps the even integers onto all the integers. Just try some examples to 
convince yourself of that fact. Once you see the pattern, you will see how 
the proof should go. 

Let y € Z. Then let n = 2y. Since n is even, we get f(n) = f(2y) = 2y/ 
2 = y, proving that y © Im f. We have proved that every element of Z is in 
the image of f and therefore fis surjective. 


Consider the function f: R > R defined by f(x) = x° + 4x + 1. We saw in 
Example 15 of Section 3.1 that Im f = R. It follows that f is surjective. 

It is important to note the difference in the method of proof for this 
example as compared with the previous ones. In Examples 5 and 6, given 
y € R, we found an x in R such that f(x) = y by taking the equation y = 
f(x) and solving for x in terms of y. Essentially the same method was used 
in Examples 8 and 9. But for this example, getting x in terms of y requires 
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solving a cubic equation, which is a difficult computation in general. So 
instead, we relied on the Intermediate Value Theorem to show that x exists 
without actually solving for x. 


Injective Functions 


Some functions have the property that two or more elements of the domain 
may be assigned to the same element of the codomain. In Example 2 of 
Section 3.1, we have g(1) = g(6) = g(12) = 0. 

We wish to single out functions for which this is not the case; that is, 
functions for which no two different elements of the domain are assigned 
the same value of the codomain. 


Definition 3.2.2 Let f: A — B be a function. Then fis injective (or an 
injection) if whenever a,, a, © A and a, ¥ a), we have f(a) ¥ f(a). 


. 3 . Es . 
A B A B 


EXAMPLE I! 


EXAMPLE 12 


f is not an injection: f takes fis an injection: f takes different 
different elements of A to the elements of A to different elements 
same element of B. of B. (It must do so for all pairs of 


distinct elements of A.) 


The accompanying diagram suggests the phrase ‘“‘one-to-one.” As ‘“‘onto” 
is used for surjective, this terminology is often used for injective. 


Let A = {1, 6, 12}, B = {0, 2, 12}. Define f: A — B by f(1) = 2, f(6) = 0, 
and f(12) = 12, as defined in Example 1 of Section 3.1. Since no two elements 
of the domain are mapped to the same element of the codomain, fis injective. 


Let g be as defined in Example 2 of Section 3.1. Then, since g(1) = g(6) = 
g(12) = 0, g is not injective. 


Using the definition is not always the easiest way to prove a function f 
is injective. We’ll try writing the definition another way. 


EXAMPLE 13 


EXAMPLE |4 


EXAMPLE [5 


EXAMPLE 16 


EXAMPLE |7 


3.2. SURJECTIVE AND INJECTIVE FUNCTIONS 101 


Let P(a,, a) be the open sentence “‘a; ¥ a,” and Q(a, a) be the open 
sentence “‘f(a,) # f(a). Then f is injective if 


for all a;, a2 € A, P(a, a) > O(a, a). 


For any values of the variables a, a), P(a:, a2) > Q(a,, a) is equivalent 
to its contrapositive —Q(a;, a) > AP(a), a2). nQ(a), ay) is the open sentence 
f(a) = f(a) and —P(a,, a) is the open sentence a; = a,. Hence we can say 
that f is injective if 


for all a,, a7 © A, whenever f(a.) = f(a), then a; = a. 


A proof that a function f is injective usually should be done as follows. 
Let a,, a, € A, where A is the domain of the function, and assume f(a,) = 
f(a). Starting with this assumption, then prove that a; = a). 


Let f: R > R be defined by f(x) = x°. Suppose that x,, x. € R and f(x) = 
f(x), that is, x} = x3. Since every real number has a unique cube root, we 
conclude that x; = Wx} = Wx3 = x), and so fis injective. 


Any nonconstant, linear function f: R — R is injective. Let f(x) = ax + b. 
Suppose that x,, x. © R and f(x,) = f(x). Then ax, + b = ax, + b, and so 
ax, = ax,. But since f is nonconstant, a # 0 and so x, = x,, completing 
the proof. 


Consider the function f: Z — Z defined by 


n+2, ifniseven 
2n+1, ifnisodd. 


ro { 


To show that fis injective, suppose that n,, nm. € Z and f(n,) = f(m). We 
consider three cases. 

If n; and n, are both even, then n, + 2 = m + 2,son, =m. 

If n, and n, are both odd, then 2m, + 1 = 2m + 1, and again n, = ny. 

Finally, if m, is even and n, is odd, then f(n,) is even and f(m) is odd. 
Thus f(,) + f(m), meaning that this case cannot even occur since we are 
assuming that f(,) = f(m). 

Therefore, in all cases for which f(n,) = f(m)), we get nm, = n2, implying 
that f is injective. 


Let A be any set and let i,: A — A be the identity function. Then i, is 
injective. (See Exercise 11.) 


Let f: R— R be defined by f(x) = x3 + 4x + 1, as in Example 10. To prove 
that fis injective using the method of the previous few examples is difficult 
because of the algebra involved. Instead we prove that if x, # x, then 


f%1) 7 fa). 
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Recall that in Example 15 of Sec. 3.1, we showed that fis an increasing 
function. From this fact it follows that if x, < x), then f(x,) < f(x.) and if 
xX, > x, then f(x,) > f(x). Hence f is injective. 


To prove that a function is not injective, we must resort to the negation 
of the definition. As we noted previously, a function f: A — B is injective 
if for all a,, a, € A, P(a,, a) > Q(a,, a2) where P(a,, a) is the open sentence 
“a, ~ a,” and Q(a,, a2) is the open sentence “‘f(a,) ¥ f(a,).”” To negate this, 
we need a statement with an existential quantifier: da,, a; © A such that 
P(a,, a) => Q(a,, a) is false. Since an implication, P > Q, is false only when 
P is true and Q is false, it follows that a function f is not injective if da,, 
a © A such that a, ¥ a, and f(a) = f(a). Note that this formulation fits 
our understanding of what it means for a function to be ‘‘not injective’’; 
namely, f maps two distinct elements of A to the same element of B. 


The function f: R > R defined by f(x) = x’ is not injective. We need only 
exhibit elements a), a; € R such that a, ~ a and f(a,) = f(a). This is easy 


since f(—1) = 1 = f(1). 
The function f: Z — Z defined by 


n+1, ifniseven 
2n+1, ifnis odd 


fn) = | 
is not injective, since, for example, f(1) = 3 = f(2). 


Bijective Functions 


As we’ve seen in the examples, a nonconstant linear function is both injective 
and surjective. Functions with this property are given a special name. 


Definition 3.2.3 A function that is both injective and surjective is called 
bijective or a bijection. 


Let A = {4, 6, 8, 10} and B = {—1, —3, 0, 4}. Then the function f: A > B 
defined by f(4) = 4, f(6) = —1, f(8) = 0, f(10) = —3 is a bijection. 


The function f: R > R defined by f(x) = x° is bijective since we have seen 
from Examples 6 and 13 that it is both injective and surjective. 


Let A be any set and let i,: A > A be the identity function. Then i, is bijective. 


Any nonconstant linear function is bijective by Examples 5 and 14. 


EXAMPLE 24 


EXAMPLE 25 


EXAMPLE 26 
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Let A = {0, 1, 2} and let f: A — A be defined by f(0) = 2, f(1) = 0, f(2) = 
1. It is clear that fis a bijection. One can think of f as a rearrangement of 
the elements of A. Such a rearrangement is called a permutation. 


More generally, we have the following definition. 


Definition 3.2.4 Let A be any set. A bijection f: A > A is called a 
permutation of A. 


Let A = {0, 1, 2}. In Example 24, we have one example of a permutation 
of A. It is easy to define others. In fact, there are 3! or six such permutations 
of A. (See the exercises.) 


The function f: R > R defined by f(x) = x° is a permutation of R. 


« MATHEMATICAL PERSPECTIVE: SETS OF PERMUTATIONS 


As we saw at the end of this section, a permutation is the mathematical way 
of viewing a rearrangement. For example, if you place your CDs in a holder 
in some order (say alphabetically by artist and chronologically for each 
separate artist), you have in fact chosen a permutation of your collection. 
But how many different orders could you choose? That is, how many permu- 
tations are there? 

In true mathematical fashion, let’s ask the question in the abstract: how 
many permutations are there of the set A = {1,2..., nm}? We could begin 
by trying some easy, special cases. When n = 2, there are clearly just two 
permutations (remember that the identity function is a permutation). When 
n = 3, there are six: 


1-1 1—>2 1-3 1-1 13> 2 1-3 
2—>2 2-1 2-2 2—3 2-3 2>1 
333 353 35> 1 3—>2 3-5 1 352 


With a little more work, you can see that there are 24 permutations of 
{1, 2, 3, 4}, and with a lot more work that there are 120 permutations of 
{1, 2, 3, 4, 5}. At this point, we can guess that, in general, there are n! = 1: 
2:-3...+n permutations of {1,2..., nm}. (Don’t forget: this is not a proof, 
just speculation based on a few examples. You'll be asked to rigorously 
prove this fact in the exercises for Section 5.2.) 

One of the authors had a relative who was an inveterate player of the 
Massachusetts daily lottery. The winning number each day had four digits, 
and so was in the range 0000 ... 9999, and the player could bet that her 
number would come up either in exact order or in any order. Thus, an 
“exact” bet on 1234 had only one chance in ten thousand of winning, but 
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an “any order” bet had 24 chances in ten thousand, because there are 24 
permutations of 1234 (of course, such a win paid less). However, the chances 
of an exact bet on 1111 winning (1 in 10000) were the same as those for an 
any order bet: rearranging the digits doesn’t change the number! For years, 
the relative mentioned bet $1 a day on her lucky number: 3331, any order. 
No amount of mathematical argument could convince her to improve her 
chances of winning by picking a number with no repeated digits. (What were 
her chances? In her defense, the odds are stacked against her either way.) 

Both of these examples involve counting, and are simple instances of the 
mathematical field of combinatorics. In fact, an entire field can be said to 
have grown just from the study of permutations. 

If you arrange your CD collection in a certain way, become dissatisfied 
with it because you can never remember where your favorite ones are, and 
so rearrange them, the net result is still a particular arrangement. That is, 
applying two permutations successively leads to another. For a more explicit 
example, consider doing this with the second and third permutations of 
{1, 2, 3} in the previous table: 


11> 252 
2-1-3 
333-1 


The net effect is the fifth permutation in the table. We are composing these 
two functions here; more about this in the next section. 

Further, each permutation can be undone by composing it with another 
particular one; if you decide that your new CD arrangement is even worse, 
you can always return to the original one. Or 


1-2-1 
2-3-2 
3>-1->3 


The “‘inverse”’ of the fifth permutation in the table is the sixth. 

That these two properties hold (along with a third, called associativity) 
means that the collection of permutations of a set A forms what mathemati- 
cians call a group, and their study is the field of group theory. Groups 
arise in many contexts of mathematics and physics; they are fundamental in 
quantum mechanics, for example. Their properties can also be used to solve 
some popular puzzles, such as Rubik’s Cube or the many incarnations of 
the 15 puzzle: 
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(Notice that moving a tile amounts to a permutation of the 15 numbered 
tiles and the blank space!) 


Exercises 3.2 


1. Determine which of the following functions are surjective. Give a formal 
proof of your answer. 
(a) f: ROR, f(x) = 2x +1 
(b) f: RR, f(x) = cos x 
(c) f: R > [-1, 1], f(x) = sin x 
(d) f: R* — R*, where R* is the set of nonzero real numbers, 


fx) =* 


(ce) fi ROR, f(x) = x° + 
(ff) f ROR, f(x) = 84+ 4 
(g) f: R — R’, R‘* the set of positive real numbers, f(x) = e* 


. _ faet+l, ifx=t 
th) FR Rf) = | Y, ifx<1 
_ eked Aiee0 
@ RoR s=[ Et 


(i) f: Z—> ZX Z, f(n) = (n,n) 
(k) f: ZX Z— Z, f(m,n) =m+n 


Xx 
(Ql) f: RX R*>R f(x, y) 5 
2. Determine which of the following functions are surjective. Give a formal 


proof of your answer. In each case, f: Z —> Z. 
(a) f(n) = 2n+ 1 


(b) f(n) | n, ifniseven 
n —- 
2n — 1, if nis odd 
(c) flr) | n, if nis even 
n —. 
n-1,  ifnisodd 
n, if nis even 
d —. aa 
(a) F) fe ifn is odd 
(e) fn) ane ifn is even 
e) f(n) = 
2n, if nis odd 


3. Let f: Z— Z be defined by 


n, ifnis even 


imy= a ee. sippienddl 
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10. 


11. 


12. 


Is the following a correct proof that f is surjective? Explain. 


Let y € Z If y is even, then f(y) = y. If y is odd, then f((y — 
3)/2) = y. In either case we get that y is in the image of f and 
therefore f is surjective. 


. Let A and B be sets. Let 7: A X B — A and 7: A X B — B be 


the projection functions defined in Example 4. Prove that 7, and m 
are surjective. 


. Let A and B be finite sets and let f: A — B be a function. 


(a) Prove that if f is surjective, then |A| = |B]. 
Hint: See Proposition 3.1.9 of Section 3.1. 
(b) State the contrapositive of part (a). 
(c) State the converse of part (a). Prove or disprove. 


. Let A and B be sets and let X be a subset of A. Let f: A > B bea 


surjective function. 

(a) Prove that B — f(X) Cf(A — X). 

(b) Give an example to show that equality does not in general hold in 
part (a). 


. Let f: A — B be a function. Prove that f is surjective if and only if 


f-'\(W) # ©@ for all nonempty subsets W of B. 


. Let f: A — B be a function. Which of the following statements are 


equivalent to the statement “‘f is surjective’? 
(a) For allb € B, Ja EC A 3D f(a) = 5. 

(b) Ifa EA, f(a) =DEB. 

(c) BCImf. 

(d) 3b € B 35 f(a) = b for some a € A. 


. Write out two injective functions with domain {0, 1, 2, 3, 4} and codomain 


{5, 6, 7, 8, 9, 10}. 


Explain why you cannot define an injective function with domain {0, 1, 
2, 3, 4} and codomain {5, 6, 7, 8}. 


Let A be any set and let i,: A — A be the identity function. Prove that 
i, 1S injective. 
Determine which of the following functions are injective. Give a formal 
proof of your answer. 
(a) f: ROR, f(x) = 2x + 1 
(b) f: RR, f(x) = cos x 

7 


(c) f: |-#.4| — R, f(x) = sin x 
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14. 


15. 


16. 
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(d) f: R* > R*, where R* is the set of nonzero real numbers, 
1 
fa) => 
(e) f: R > R*, R* the set of positive real numbers, f(x) = e* 


(f) fiZ>Z Xx Z, f(n) = (n,n) 
(g) f:Z X ZZ, f(m,n) =mt+n 


(h) RX R* SR, fle y) = 5 
Determine which of the following functions are injective. Give a formal 


proof of your answer. In each case, f: Z > Z. 
(a) f(n) = 2n +1 


(b) fn) n, ifniseven 
n bs 
2n-1, ifn is odd 
() fln) | n, ifn is even 
c) f(n) = 
n—-1, ifn is odd 
n, ifn is even 
d — — 
(a) A) as if n is odd 
(e) fn) teed if n is even 
e) f(n) = 
2n, ifn is odd 


Determine which of the following functions are injective. Give a formal 
proof of your answer. In each case, f: R > R. 

(a) f(x) = xt 4+ xX? 

(b) f(x) = 4+ x? 


(CV fh) =e = x 
re a 1, ifx=1 
x’, ifx<1 
_f-x-1, ifx=0 
(e) F@) = 2. - dee 


Let A and B be sets. Let 7: A X B— A and m: A X B > B be the 
projection functions defined in Example 4. Prove that 7, is not injective 
unless B consists of only one element and that 7, is not injective unless 
A has only one element. 


Let A and B be sets and X and Y be subsets of A. Let f: A — B be 
an injective function. Prove that if f(X) C f(Y), then X C Y. (See 
Proposition 3.1.5.) 


Let A and B be sets and X and Y subsets of A. Let f: A — B be an 
injective function. Prove that f(X N Y) = f(X) N f(Y). (See Exercise 
15 of Section 3.1.) 
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18. 


19. 


20. 


21. 


22. 


23. 
24. 


25. 


26. 


27, 


28. 


29. 


Let A and B be sets and X and Y be subsets of A. Let f: A > B be an 
injective function. Prove that f(X) — f(Y) = f(X — Y). (See Exercise 
16 of Section 3.1.) 


Let A and B be finite sets and let f: A — B be a function. 

(a) Prove that if f is injective, then |Im f| = |A]. 
Hint: See Proposition 3.1.9 of Section 3.1. 

(b) Is the converse of part (a) true? Prove or disprove. 

(c) Prove that if f is injective, then |A| = |B]. 

(d) State the contrapositive of part (c). Give a proof of the contrapositive 
that doesn’t use part (c) by invoking the Pigeonhole Principle. 

(e) State the converse of part (c). Prove or disprove. 


Let f: A — B be a function. Which of the following statements are 
equivalent to the statement ‘‘f is injective’? 

(a) f(a) = f(b) when a = b. 

(b) f(a) = f(b) and a = b for all a, bin A. 

(c) If a and b are in A and f(a) = f(b), then a = b. 

(d) If a and b are in A and a = BD, then f(a) = f(b). 

(e) If a and b are in A and f(a) # f(b), then a # b. 

(f) Ifa and b are inA and a # 5, then f(a) # f(b). 


Let f: A > B be an injective function. Let X be a subset of A. Prove 
that f-'(f(X)) = X. (See Exercise 21 of Section 3.1.) 


Let f: A — B bea surjective function. Let W be a subset of B. Prove 
that f(f-'\(W)) = W. (See Exercise 22 of Section 3.1.) 


Write out all permutations of the set A = {0, 1, 2}. 


(a) Write out three permutations of R. 
(b) Write out three permutations of Z. 


Using only the definitions ofinjective and surjective, complete the follow- 
ing statements: 

(a) A function f: A — B is not injective if and only if .... 

(b) A function f: A — B is not surjective if and only if.... 

(c) A function f: A — B is not bijective if and only if... . 


Let f: A — B be a function where A and B are finite sets such that 
|A| = |B|. Prove that fis injective <= f is surjective. 

Let A and B be finite sets and let f: A — B be a function. 

(a) Prove that if f is bijective, then |A| = |B]. 

(b) State the contrapositive of part (a). 

(c) State the converse of part (a). Prove or disprove. 


Let A and B be sets and let X be a subset of A. Let f: A > B bea 
bijection. Prove that f(A — X) = B — f(X). (See Exercise 6.) 
Let A and B be sets and let f: A — B be a function. Prove that f is 


bijective if and only if for every b € B, f~'({b}) is a single element subset 
of A. 
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30. Let A and B be sets and let f: A — B be a surjective function. For each 
b © B, let A, = f'({b}). Prove that the collection of sets {A, | b € B} 
is a partition of A. 


Discussion and Discovery Exercises 


D1. Let P and Q be statements. Suppose that one wanted to prove the 
statement: P and Q are not both true. Explain why a valid proof of this 
statement would be to prove P > —Q. 


D2. Using a first-year calculus text, look up the proof that says that if a real 
valued function f is differentiable at a number a, then f is continuous 
at a. Write out a careful, detailed explanation of this proof, listing all 
definitions and prior theorems that are used in the proof. 


D3. Use the previous two problems to prove that a function f cannot be 
both differentiable at 0 and discontinuous at 0. Include in your proof 
the relevant statements P and Q and explain how you use the logical 
principle of Exercise D1. 


D4. Using your knowledge of first-year calculus, explain why there cannot 
be an injective function f: R — R such that f(x) = In x for all x > 0. 
(In is the natural logarithm function.) 


DS. A real-valued function f defined on a closed interval [a, b] is said to 
be decreasing on [a, 5] if for all x,, x. © [a, b], if x; < x), then f(x;) > 
Ff (x2). 

(a) Negate this definition by completing the following sentence: fis not 
decreasing on [a, b] if.... 

(b) Prove that every decreasing function on a closed interval [a, b] 
is injective. 

D6. In light of Example 17 of this section, state and prove a theorem 
that relates how the derivative of a function f: R — R can be used 
to determine if f is injective on an interval [a, b]. Be as general as 
possible. 


D7. Let f: R — R be defined by f(x) = 2x + 1. Criticize the following 
“proof” that f is surjective. 


Let ye Ry = 2+ 1x = 


Therefore, 


fo = (24) =2t ay, 


Hence y C Imf. 
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COMPOSITION AND INVERTIBLE FUNCTIONS 


The concept of function is supremely important and is fundamental to virtu- 
ally all of mathematics. We often step back from viewing functions as “‘rules”’ 
relating objects from different sets and study functions as objects themselves. 
In this way, we enrich other concepts; we can, for example, consider sets of 
functions, and devise ways to combine functions and create others. 


Definition 3.3.1 Let A and B be nonempty sets. We define F(A, B) = 
{f | f: A — B}, the set of all functions from A to B. If A = B, we simply 
write F(A). 


As we go along, we will study these sets in many different contexts, and 
find them interesting sources of unusual examples. For now, notice that we 
can abbreviate the statement that f: A — B is a function to f € F(A, B). 


Composition of Functions 


One of the most important ways of creating new functions in calculus is by 
taking the composition of two functions. We can define composition of 
functions in general as long as the domains and codomains match up appro- 
priately. 


Definition 3.3.2 Let A, B, and C be nonempty sets, and let f € F(A, 
B), g € F(B, C). We define a function gf € F(A, C), called the composition 
of f and g, by gf(a) = g(f(a)), Va € A. 


gf 


EXAMPLE | 


EXAMPLE 2 


EXAMPLE 3 
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Note: We will use the simple juxtaposition gf to denote composition of functions 
instead of the more standard go f. 


Notice that if f, g € F(A), then both gf and fg are defined. 


Let A = {0, 1}, B = {red, green, blue}, and C = {7, e}. Define a function 
f: A — B by f(0) = red and f(1) = blue; also define a function g: B > C 
by g(red) = 7, g(green) = e, and g(blue) = e. 

Then gf(0) = 7 and gf(1) =e. 

Notice that the function fg (composing in the reverse order) is not defined; 
this is because the codomain of g is not the domain of f. In simpler terms, 
f, for example, does not assign a value to 7 = g(red). 


Let f, g © F(R) be defined by f(x) = x? and g(x) = x + 1. Then gf(x) = 
g(x?) =x? + 1. 

Here we can consider the function fg as well, and we see that fg(x) = 
f(x + 1) = (x + 1)*. Thus fg and gf do exist, but fg # gf! 

In this example, you should clearly understand why we concluded that 
fg # gf. Recall that two functions in F(R) are equal if and only if they take 
the same values at all x € R; the statement has a universal quantifier. Thus 
they are not equal if we can find at least one x for which they do not agree. 
But gf(1) = 12 + 1 = 2 and fg(1) = (1 + 1) = 4. 


Let f, g € F(Z) be defined by 


es 2n, if nis even 
f(n) ae if nis even tate 
n)= an => 
ate uiP pig odd ch moe ienisedd. 


If n is even, gf(n) = g(n + 2) = 2(n + 2) since n + 2 is even. If n is 
odd, gf(n) = g(2n + 1) = (Qn +: 14+ 1)/2 =n + 1. Thus we have 


2n + 4, ifn even 
n+1, if is odd. 


gf(n) = | 


Now we compute the composition fg. If n is even, fg(n) = f(2n) = 
2n + 2. If nis odd, fg(n) = f((m + 1)/2). The value of f((n + 1)/2) depends 
on whether (n + 1)/2 is even or odd. 

If (n + 1)/2 is even, then (n + 1)/2 = 2t for some integer ft or equivalently 
n = 4t — 1. In this case, 
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If (n + 1)/2 is odd, then (n + 1)/2 = 2t + 1 for some integer ¢t or 
equivalently n = 4t + 1. We get 


Thus 
‘2n + 2, if nis even 
fe(n) = a ifn = 41 — 1 forsomer € Z 


.n+ 2, ifn = 4t + 1 forsomet € Z. 


In computing composition of functions or computing any function for 
that matter, take care to differentiate between the symbol for the function 
and the value of the function at an element of its domain. In the last example, 
we computed the composition gf by finding gf(n) for any n in Z. In doing 
the computation of gf it is incorrect to write gf = 2n + 4 when n is even 
since gf and 2n + 4 do not belong to the same set. gf is a function and 
2n + 4 is an integer. The expression gf(n), however, is the value of the 
function gf at n and therefore is an integer. So the correct expression is 
ef(n) = 2n + 4. 


Let A bea set. Then the identity function i, has a special property with 
respect to composition. 


Let f: A — B be a function. Then fi, = f and igf = f. 


PROOF: The proof is left as an exercise. @* 


A question one might ask is ‘“Which properties of f and g carry over to 
the composition gf?” 


Let f € F(A, B) and g € F(B, C). 


1. If f and g are surjections, then gf is also a surjection. 
2. If f and g are injections, then gfis also an injection. 
3. If fand g are bijections, then gf is also a bijection. 


PROOF: 1. Suppose that fand g are surjective functions. Since gf is a function 
from A to C, to show that gfis surjective, it is necessary to take an arbitrary 
element c in C and show that it is in the image of gf; that is, given c in C, 
we must show that there is an element a in A such that gf(a) = c. 

So let c € C. Then since g is surjective, there exists b © B such 
that g(b) = c. By the same token, since f is surjective and b € B, there 
exists a € A such that f(a) = b. But then gf(a) = g(b) = c, proving that 
c € Im gf. Therefore, gf is surjective. 


Corollary 3.3.5 


Proposition 3.3.6 
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2. Suppose that f and g are injective functions. To prove that gf is in- 
jective, we must take arbitrary elements a,, a, © A, and assuming that 
gf(a,) = gf(a), we must then show that a, = a). 

So let a;, a7 € A and suppose that gf(a,) = gf(a). Now gf(a:) = gf(a:) 
means that g(f(a,)) = g(f(a)) and since g is injective it follows that 
f(a) = f(a). But then we can conclude that a, = a, since fis also injective. 
Hence gf is injective. 

3. This follows immediately from 1 and 2. «% 


Let A be a nonempty set. Let f and g be two permutations of A. Then gf is 
a permutation of A. 


PROOF: The proof is left as an exercise. «.* 


You may wonder whether the converses of the statements in Proposition 
3.3.4 are also true. It seems reasonable, for example, that if gf is bijective, 
then fand g must be as well; however, it’s false, as you will see in the exercises. 

Composition of functions shares a property with the operations of addi- 
tion and multiplication of real numbers: associativity. The next proposition 
gives a proof of this result. 


Let A, B, C, and D be nonempty sets. Let f € F(A, B), g € F(B, C),h © 
F(C, D). Then (hg) f = h(gf). 


PROOF: Recall that two functions are equal if they have the same domain 
and codomain and if they assign each element of the domain to the same 
element of the codomain. In this case, both (hg) f and h(gf) have domain A 
and codomain D. 

Now let a € A. Then 


[(hg) f](a) = (hg)( f(a) = h(g(f(@)) 


and 


[h(gf)](a) = h(gf(a)) = h(g(f(a)))- 


Thus [(hg)f](a) = [A(gf)](a@) for all a € A. It now follows that (hg) f = 
gf). ® 


Because of the previous proposition, we can delete the parentheses and 
simply write hgf for the composition of three functions without ambiguity. 

The above proposition can be extended to four or more functions so that 
if we have n functions f,, f;, ... f, for any positive integer m we can write 
fifi. ..- fy for their composition without need of parentheses, provided of 
course that these compositions are all defined. 
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Inverse Functions 


You are probably familiar with the notion of an inverse function from calcu- 
lus. What the inverse of a function f does, if it can be defined, is map an 
element f(a) of the image of f back to a. For example, if f(x) = x’, then the 
inverse of f, if it exists, should map 4 = f(2) to 2. In general, it should send 
f(@ = @ back to a and if a = 0, the function that will do this is the square 
root function. (See Example 7 for a more detailed discussion of this example.) 

Therefore, the inverse of a function f, when composed with f, should be 
the same as the identity function, which maps every element to itself. 

So let’s try to define the inverse of a function f: A — B where A and B 
are any sets. Since f sends an element of A to an element of B, the inverse 
function must map elements of B to A. More specifically, if f maps a in A 
to b in B, then the inverse of f, which we will denote f', must map b to a. 


| 
<q 
fl 


Thus the composition of f and its inverse must be the identity mapping 
of the appropriate set. 


ffaig 


This leads to the following definition. 


EXAMPLE 4 


EXAMPLE 5 


EXAMPLE 6 


EXAMPLE 7 


3.3. COMPOSITION AND INVERTIBLE FUNCTIONS 115 


Definition 3.3.7 Let A and B be sets, and let f € F(A, B). Then f is 
invertible if there is a function f-! © F(B, A) such that ff = i; and 
f“f = is. If f7! exists, it is called the inverse of f. 


Note that the symmetry of this definition implies that if f is invertible, 
then its inverse function f~ is also invertible and that the inverse of f~ is f. 

Note also that we have to justify calling f~' the inverse of f. To see that 
there is only one function that can be the inverse of a function f, suppose 
there exists another function g: B — A such that gf = i, and fg = ig. Then, 
using Propositions 3.3.3 and 3.3.6 as well as the definition of invertible 
function, we get g = gis = g(ff") = (gf)f! = inf = fr. 

Thus f7! is unique. 


Let f € F(R) be defined by f(x) = x°. Then f is invertible and f(x) = 
Wx. 


Define f € F({1, 2}, {1}) by f(1) = f(2) = 1. (It’s the only such function, 
really.) Then fis not invertible because there are only two functions in F({1}, 
{1, 2}), and neither one satisfies the definition of inverse. 


Let A be any set and let i4:A — A be the identity function. Then i, is 
invertible and i4! = i,. 


Let f € F(R) be defined by f(x) = x’. As previously noted, the inverse of 
this function, if it exists, must be the function f-!(x) = Vx. But we run into 
a couple of problems with this example. 

First of all, because R is the domain and codomain of f, it should also 
be the domain and codomain of f™!. But Vx is not defined for all real 
numbers, only for nonnegative ones. This problem can be fixed up easily by 
just redefining the codomain of f to coincide with its image, namely, the set 
[0, 2°) of nonnegative real numbers. In effect, we have made f a surjective 
function. 

The second problem is that since f(2) = f(—2) = 4, f-! must map 4 to 
both 2 and —2. But no function can send an element of its domain to more 
than one element of its codomain by the very definition of function. The 
problem is that f is not injective. So our function f, even with the new 
codomain, is not invertible because it is not injective. 

We can, however, make f injective by restricting its domain to the set 
[0, 2). We now have a new function f € F([0, ~)) defined by f(x) = x’, 
which is injective and surjective and its inverse function f7! is in F({0, ~)) 
and is defined by f(x) = Vx. 
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In light of the last example, we should expect that an invertible function 
must be injective and surjective. The converse is also true. We prove this in 
the following theorem. 


Let A and B be sets, and let f € F(A, B). Then f is invertible if and only if 
f is bijective. 

PROOF: Suppose that fis invertible, so that f~' exists. We need to show that 
f is both injective and surjective. 

To show that fis injective, suppose that a,, a, € A and f(a,) = f(a). 
Applying f~' to both sides of this equation gives f'f(a,) = f~'f(a). Since 
ff =i4, we get i,(a,) = i,(a,) and this implies that a; = a). Hence fis injective. 

To see that f is also surjective, we need to show that B, the codomain 
of f, equals the image of f. Since the image is by definition a subset of the 
codomain, we only have to show that every element of B is in the image of f. 

Let b € B. If we let a = f-'(b), we get f(a) = f(f(b)) = ff) = 
izg(b) = b. This proves that b is in the image of f and therefore f is surjective. 

Conversely, suppose that f is bijective. To prove that fis invertible, we 
must define a function f™! such that ff! = is and f!f = i,. We start by 
letting b € B. Since f is surjective, we can find a € A such that f(a) = b; 
since f is injective, only one such a exists. Define f~'(b) = a. 

Then ff-'(b) = f(a) = b, so ff! = ig. 

Now let a € A and let b = f(a). Then by the definition of f"', 
f-\(b) = a. Consequently, f-'f(a) = f(b) = a. Therefore, f-'f = iy. 

Thus f is invertible. 


We saw in Example 23 of Section 3.2 that any nonconstant linear function 
is bijective. It follows then from Theorem 3.3.8 that any nonconstant linear 
function is invertible. But how do we find its inverse? Let f(x) = ax + b, 
a # 0. To find f~!, we merely let y = ax + b and solve for x in terms of y. 
We get f-'(y) = (y — b)/a. The reader should verify that this is indeed the 
inverse of f. 


The function f: [— 7/2, 7/2] — [-1, 1] defined by f(x) = sin x is bijective 
and therefore invertible. Its inverse is the arcsine function. Note that we 
restricted the domain of f to the interval [— 7/2, 7/2] to make it injective 
and used the image of the sine function as the codomain to make it surjective. 


Sometimes, when a function f is invertible, it may be very difficult or 
impossible to write out a formula for f~!. Nevertheless, thanks to Theorem 
3.3.8, it is possible to determine that f exists even if we cannot find a specific 
formula for it. 


Let f: R — R be defined by f(x) = x? + 4x + 1. In Section 3.2, we showed 
that fis bijective. (See Examples 10 and 17 of Section 3.2.) Then by Theorem 
3.3.8, f is invertible. 
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€2 MATHEMATICAL PERSPECTIVE: MULTI-VALUED FUNCTIONS 


You might remember from your calculus course that many of the functions 
studied there do not have inverses, because they fail the ‘“‘horizontal line 
test.” For example, the sine function has no inverse because, in particular, 
the horizontal line y = 1 hits the graph at every peak. 


This means that if we try to construct the inverse, we cannot decide 
how to define sin7'(1). One choice is to restrict the domain of the sine 
to [—7/2, 7/2], as we did in Example 9. However, we can try to draw the 
graph of the inverse by simply reflecting the above graph about the line 


y=x. 


The resulting graph does not represent a function because, of course, it 
fails the vertical line test: there are now many y-values corresponding to 
x = 1. Such graphs are said to represent multi-valued functions. They are 
studied not by restricting the domain, but instead by extending it. That is, 
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we would “‘paste’’ together copies of the interval [—1, 1] so that each y- 
value corresponds to a single x-value in only one of the copies. 

This ingenious (and hard to visualize) idea was formulated by Bernhard 
Riemann (1826-1866), and grew from his study of functions of a complex 
variable (see Section 7.3). The extended domain is called the Riemann Surface 
of the multi-valued function. For example, when the natural logarithm func- 
tion is extended to the complex numbers in the most appropriate way, it 
leads to a multi-valued function whose Riemann surface can be visualized 
as a Spiral: 


Remember that this picture represents the domain of the function; a 
graph of the function itself would require four dimensions. 


Exercises 3.3 


1. Let f(x) = x? + 1, g(x) = sin x, and h(x) = e” in F(R). 
(a) Compute fg and gf. 
(b) Compute fh and hf. 
(c) Compute gh and hg. 


2. For the following functions f, g € F(Z), compute fg and gf. 


(a) f(n) =2 d g(n) n, if nis even 
a — a = 
oi iat 2n— 1, if nis odd. 


if niseven 1, if niseven 


~ 
and n)= 
a(n) { bn. “itnisedd. 


n 
(b) f(n)= 2” 
n+1, ifn isodd 


53 
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Express each of the following functions as a composition f = gh. Be sure 
to give appropriate sets A, B, and C so that h € F(A, B) and g € F(B, 
C). (Neither g nor / should be an identity function, but there may be 
many possible answers.) 

(a) f © F(R) defined by f(x) = (3x’ — 11)8 


(b) f € F(R) defined by f(x) = cos(# : 


(c) f € F(Z, R) defined by f(x) = cos(# ; ‘) 


(d) f € F({0, 1}, R) defined by f(0) = 1 and f(1) = 0 
(e) f € F(R, Z*) defined by f(x) = |[x]| 


4. Prove Proposition 3.3.3. 


5. Prove Corollary 3.3.5. 


10. 


. State the converse of each part of Proposition 3.3.4. Prove or disprove 


each one. 


. Let A, B, and C be nonempty sets, and let f € F(A, B) and g € F(B, C). 


(a) Prove that if gf is injective, then f is injective. 
(b) Prove that if gf is surjective, then g is surjective. 


. Let A and B be nonempty sets. Let f € F(A, B) and g € F(B, A). Prove 


that if gf and fg are bijective, then so are f and g. 
Hint: Use the previous exercise. 


. Let f: A > B and suppose that S C A, S # ©. We can define a function 


fis: S > B, called f restricted to S, by simply taking fs(a) = f(a), Va € S. 

(a) If fis injective, prove that fis is injective. 

(b) Prove or disprove the converse of part (a). 

(c) Give an example to show that if f is surjective, then fis need not 
be surjective. 

(d) Give an example in which f is surjective, S is a proper subset of A, 
and fis is also surjective. 

(e) If f € F(A) and S C A, what can you say about ffis and fisf? 


Determine which of the following functions are invertible. If a function 
fis invertible, find f-'. 
(a) f: {-1, 0, 1} > {a, e, V2} defined by f(-1) = e, f(0) = e, and 


f(l) = V2 
(b) f: {-1, 0, 1} > {z, e, V2} defined by f(-1) = e, f(0) = 7, and 
fl) = V2 


(c) f: Z— Z defined by f(x) = 7x + 3 

(d) f: R = R defined by f(x) = 7x + 3 

(e) f: Z— Z defined by f(n) = 2n 

(f) f: Z— E, E the set of even integers, defined by f(n) = 2n 
(g) f: R = R defined by f(x) = cos x 
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11 


12. 


13. 


14. 
5: 


16. 


17. 


18. 


. Define f: RR — 14} > R - {1} by f(x)= 


(h) f: [0, 7] — [-1, 1] defined by f(x) = cos x 
(i) f: RR defined by f(x) = e* 
(j) f: R> {x ER | x > 0} defined by f(x) = e 


(3.0), ifnEE 


(74.1), ifn€O 


4n, ifn>O and nEE 
A|n| + 1, ifn=0 and nEE 
4n + 2, ifn>0O and nEO 
4ln|+3, ifn<0O and nEO 


(k) f: Z— Z X {0, 1} defined by f(n) = 


(1) f: Z— Z* defined by f(n) = 


1 
1 


(a) Prove that f is injective. 
(b) Prove that f is surjective. 
(c) Find f7'(x). 


Let a, b, c, d © R such that ad — bc # 0 andc # 0. Define f: R — 
_ axtb 
{—d/c} > R — {a/c} by f(x) = teal 


(a) Prove that f is injective. 
(b) Prove that f is surjective. 
(c) Find f-1(x). 


Let A = {a, b, c}, a set with three elements. Let S(A) be the set of all 
permutations of A. 

(a) Write out the six elements of S(A). 

(b) Compute the inverses of each element of S(A). 


Note: Each inverse is also an element of S(A). 

Let f € F(A, B). Prove that if fis bijective, so is f~'. 

Let A be a set and let f € F(A). Prove that fis invertible and f = f7 if 
and only if ff = iy. 

Let f € F(A, B) and g € F(B, C). Prove that if f and g are invertible, 
so is gf and that (gf)! = fig. 


Let A be a nonempty set. Let f, g, h © F(A). Suppose that fg and gh 
are bijective. Prove that f, g, and h are all bijective. 


Let A, B, and C be nonempty sets, and let f € F(A, B) and g,h € F(B, C). 
(a) Prove that if fis surjective and gf = hf, then g = h. 
(b) Give an example in which gf = hf, but g # h. 


19. 


20. 
21. 
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Let A, B, and C be nonempty sets, and let f, g € F(A, B) andhe F(B,C). 
(a) Prove that if h is injective and hf = hg, then f = g. 
(b) Give an example in which hf = hg, but f # g. 


Let f(x) = x° + x. Prove that fis invertible without actually solving for f7!. 
Let A and B be sets and let f: A — B be a function. Suppose that f has 


a left inverse g; that is, there exists a function g: B — A such that gf = 
i,. Prove that f is injective. 


Let A and B be sets and let f: A — B be a function. Suppose that f has 
a right inverse h, that is, there exists a function h: B — A such that 
fh = ig. Prove that f is surjective. 


Discussion and Discovery Exercises 


D1. 


D2. 


D3. 


One of the most important theorems concerning real-valued functions 
is the Fundamental Theorem of Calculus. It is often stated in two parts. 
Part 1 says that if f(x) is continuous on the closed interval [a, b] and a 
function g is defined by g(x) = é f(t)dt, then g is differentiable on 


[a, b] and g’(x) = f(x) for all x € [a, b]; that is, g is an antiderivative 

for f. 

(a) Is the following statement true? If fis continuous on [a, b] and g 
is an antiderivative for f, then g(x) = [ , f(t)dt. 


(b) State part 2 of the Fundamental Theorem of Calculus and give 
a proof. 


Give an example of a bijective function f: R — R that is neither increas- 
ing nor decreasing. 


Let f: R — R be defined by f(x) = x? + 4x + 1. In Example 10 we 
showed that f is invertible without finding a specific formula for f7. 
The function f is differentiable everywhere. What about the function 
f-'? At what numbers is it differentiable? 


4.l 
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BINARY OPERATIONS 


We all learn early in school (if not before) about the usual arithmetic opera- 
tions: addition and multiplication, as well as subtraction and division. We 
will be studying many other operations that have some of the same properties 
as these, and so we need to introduce a general term for them. Now addition 
is an operation that takes two integers and produces a third; 2 and 3 produce 
5, for example. We can think of addition then as a function from Z Xx Z to 
Z. This function would map (2, 3) to 5, (7, —6) to 1, and so on. With this 
example in mind, we define the notion of a binary operation on a set A. 


Definition 4.1.1 A binary operation on a nonempty set A is a function 
from A X AtoA. 


A binary operation on a set A, then, can be thought of as a way of 
combining two elements a and b of A to produce a third element. Strictly 
speaking, we should use functional notation, say f(a, b), to denote that 
element, but that’s a bit cumbersome. It’s much more common to introduce 
an operation symbol analogous to + or X, such as *, to denote the binary 
operation and write a * b instead of using the functional notation * (a, b). 


The usual operations of addition, multiplication, and subtraction are, of 
course, binary operations on Z (and for that matter, on Q and R). 
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It is important to note that when combining two elements of a set by 
means of a binary operation, the third element must be in the set. So, for 
example, addition is a binary operation on Q because the sum of two fractions 
is a fraction. 


Division is not a binary operation on Z, since, for example, 1/2 is not an 
element of Z. Worse, 1/0 is undefined, so division is not a binary operation 
on Q or R either. It’s interesting (and instructive!), though, to notice that 
division is a binary operation on Q — {0} and on R — {0}. 


These examples are the most important binary operations on Z, Q, and 
R. We can of course define many others. Here are two examples. 


We will define a binary operation on Z as follows. If a, b € Z, leta * b = 
2a + b. Here the + sign is ordinary addition of integers. So for example, 
5 * 8 = 18 and (—2) * 6 = 2. 


Let * be the binary operation on Z defined by a * b = a+ b+ 1. For 
example, 6 * 10 = 17. 


These last two examples are contrived, of course, and thus not ‘‘impor- 
tant” examples of binary operations. But examples such as these serve to 
illustrate properties that a binary operation may or may not have. For in- 
stance, notice that, using the binary operation of Example 3, 8 * 5 = 21. 
Thus 5 * 8 ¥ 8 * 5. Hence a commutative law that we take for granted for 
addition and multiplication of integers does not hold for all binary operations. 
We will discuss such questions in this section. 


Let A be a nonempty set. Recall that the set F(A) is the set of all functions 
from A to A. If f and g are two elements of F(A), then their composition 
fg is also an element of F(A). Thus composition is a binary operation on 
the set F(A). This example is no doubt less familiar to you than the others, 
and has many ‘‘unusual’” properties, as you will see. 


Let A be a set. The set P(A) is the set of all subsets of A. We can define a 
binary operation on P(A) as follows. If X and Y are in P(A), let X * Y = 
X (1 Y, the intersection of the sets X and Y. Since X ™ Y is a subset of A 
and therefore in P(A), we have defined a binary operation on P(A). (Note 
that we could also define a binary operation on P(A) by taking the union 
of X and Y. This example will be considered in the exercises.) 
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Let f, g © F(R), the set of all functions from R to R. Let f* g = f+ g, the 
usual addition of functions: (f + g)(x) = f(x) + g(x) for all x in R. Since 
f + gis a function from R to R, we get a binary operation on F(R). 

Another binary operation on F(R) is ordinary multiplication of functions: 
(f * g)(x) = (f° 8)(x) = F(x)g(*). 

We will call these two operations on F(R) simply addition and multiplica- 
tion on F(R). We will use the usual + sign to denote the sum of two functions 
and - for multiplication; that is, the product of the functions f and g will be 
written f - g so as not to be confused with composition. 


Let M,(R) denote the set of all symbols of the form 
a b 
(' ) : 
where a, b, c, dE R. Such a symbol is called a 2 X 2 matrix. Matrices play 


an important role in the study of systems of linear equations. We define a 
binary operation called addition on M,(R) as follows: 


ie _) ‘ 4 ie v4 
+ = 
c d gh c+g dth 
Another binary operation, called multiplication, can also be defined on 
M,(R). Its definition is a bit more complicated than that of addition. It is 
not just multiplication of the corresponding terms. We will not go into 


the justification of this definition here except to note that it has important 
applications in linear algebra. We define 


( ’) e ’) (ee af + i 
c ae h ce+dg cft+dh/— 
Associative and Commutative Laws 


Using as our guide the operations that you’re very familiar with, we now 
begin to classify binary operations by some of their properties. 


Definition 4.1.2 A binary operation * on A is associative if (a * b) * 
c=a*(b*c),VabcEA. 
A binary operation * on A is commutative if a * b = b * a, Va, b € A. 


Note that the definitions of associative and commutative binary opera- 
tions contain universal quantifiers. Proving a binary operation associative or 
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commutative requires proving the appropriate property for all elements of 
A. On the other hand, to prove that a binary operation is not associative or 
not commutative requires only that the definition break down for some 
elements of A. Thus a single counterexample will suffice. 


Addition and multiplication are associative and commutative on Z. These 
properties are actually axioms of the integers and will be discussed in Section 
5.1. Addition and multiplication are also associative and commutative on Q 
and R. Subtraction, however, is neither associative nor commutative. For 
example, (3 — 2) -1=1-1=0,but3 —- (2-1) =3—-1=2; and 8 — 
4#4-8. 

Similarly, division fails to be associative or commutative on Q — {0} or 
R — {0}. (See Exercise 1.) 


Consider the binary operation on Z defined in Example 3: a * b = 2a + b 
for all a, b © Z. We saw that this operation is not commutative. It is not 
associative either since, for example, (2 * 3) * 4 = 7 * 4 = 18 and 2 * 
(3 * 4) =2* 10 =14. 


The binary operation on Z defined in Example 4,a * b =a +b + 1, is both 
commutative and associative. The proofs go as follows. 

Leta,b€ Z. Thena*b=a+b+1=b+a+12=bD+#a. Thus * 
is commutative. Note that we used the fact that ordinary addition in Z 
is commutative. 

Let a, b, c © Z. Then 


(a*b)*c=(at+b+1)*#c=(atb+1)+ce+1l=atbt+cr+2 
and 

ax(b¥c)=a¥*(b+ct+1)=at(6t+cec+1)+1l=atbt+crt+2. 

Hence (a * b) *c = a * (b * c) for all a, b, c € Z, proving that * is 


associative. In this proof, we used both the associativity and commutativity 
of ordinary addition in Z. 


Composition is associative on F(A), for any set A. This fact was proved in 
Proposition 3.3.6. Interestingly, composition is not usually commutative on 
F(A). For example, if A = R, consider the functions f(x) = x + 1 and 
g(x) = x: 

f(g(x)) =x? +1 but g(f(x)) = @ + 1. 


(In Exercise 3, you will see exactly what we mean here by “‘usually.’’) 
Let A be a set. The binary operation on P(A) defined in Example 6 is 


both associative and commutative. This fact follows from parts 2 and 4 of 
Proposition 2.2.2. 
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Addition on F(R) is both associative and commutative. To see that addi- 
tion is associative, let f g, h © F(R). We must show that (f + g) + 
h =f + (g + A). It is clear that the domain and codomain of both these 
functions is R. We must show that they take on the same values. Let 
x E€ R. Then 


[f+ g) + A](x) = (fF + g)(x) + A(x) (def. of addition on F(R)) 
= [f(x) + g(x)] + A(x) (def. of addition on F(R)) 
= f(x) + [g(x) + A(x)] (assoc. of addition on R) 
= f(x) + (g+h)(x) (def. of addition on F(R)) 
=[ft+(g+A)](x) (def. of addition on F(R)). 


This proves that (f+ g) t+h=ft+(gt+hy). 

Note that we use the fact that addition on R is associative. It is also 
important for proofs involving functions to note that the symbol f and the 
expression f(x) mean two different things. The symbol frefers to the function, 
an element of F(R), and f(x) is the value of the function at the real number 
x and hence is a real number itself. 

We leave the proof that addition is commutative as an exercise. (See 
Exercise 4(a).) 


Multiplication on F(R) is both commutative and associative. The proofs are 
left as exercises. (See Exercises 4(b) and 4(c).) 


Addition on M,(R) is associative and commutative. Here is a proof that 
addition is commutative: let a, b,c, def ghER. 


(" “V+(° ‘el as 
c d) \g nh) \ct+g dth 
ee eo) 
\gte Atd 
a 
= + ; 
gh cd 


We leave the proof that addition is associative as an exercise. (See Exer- 
cise 5(a).) 


Multiplication on M,(R) is associative but not commutative. We prove that 
multiplication is associative and leave the proof that it is not commutative 
as an exercise. (See Exercise 5(b).) 

LetabcodegfighxyzweEerR. 
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((" ) (‘ NC ae ae+bg ee le ”) 
c d/\g cet+dg cf+dh 


Ww 


ll 


(ae + bg)x + (af + bh)z pales en onenae 
(cet+dg)x+(cf+dh)z (ce+dg)y+(cft+dh)w 


Il 


aex + bgx + afz + bhz eae, 
cextdgxt+cfzt+dhz cey+dgy+cfw+dhw 


ll 


c(ex+fz)+d(gx+hz) cley+fw)+d(gy+hw) 


Il 


Nee oe 
exthz gyt+hw 


( 
( 
( 
oe + fz) + D(gxthz) aley + fw) + b(gy + a) 
: 
( 


ia aes)) 


We are all familiar with the fact thatx + 0 =O+x=xandx-1= 
1 - x = x for all real numbers x. Thus any number added to 0 is itself and 
any number multiplied by 1 is itself. Because of these special properties 
of 0 and 1, they are called identity elements with respect to addition and 
multiplication, respectively. 

Not all sets with binary operations have such special elements but many 
of them do. We make the following definition. 


Definition 4.1.3 If * is a binary operation of A, an element e of A is 
an identity element of A with respect to *ifa*e=e*a=a,VaEA. 


If a set A has an identity element with respect to a binary operation *, 
we will sometimes just say that * has an identity element if it is clear from 
the context what the underlying set A is. Strictly speaking, of course, it is 
the set A together with its binary operation that has the identity element. 
In addition, we will often just say identity instead of identity element. 


The identity element for Z, Q, and R with respect to addition is 0, as we 
just noted, and with respect to multiplication is 1. 

There is no identity for subtraction, though. The proof will be by con- 
tradiction. Suppose then that e is an identity for subtraction on Z. Then 
e=e—-0=0-e= ~—e, so 2e = 0, implying that e must be 0. But 0 doesn’t 
work: 0 — 1 # 1 — 0, for example. (Question: What about division on 
Q — {0} or R — {0}? See Exercise 6.) 
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We often use contradiction to show that a set has no identity with respect 
to a given binary operation; that is, we suppose that it has an identity and 
derive a contradiction. Here is another example of that technique. 


Consider the binary operation a * b = 2a + b on Z defined in Example 3. 
Suppose that Z has an identity e with respect to *. Then e * 1 = 2e + 
1 = 1, which implies 2e = 0 ore = 0. But1 *0 =2+0=2 41,500 
cannot be the identity. Therefore Z has no identity with respect to *. 


Let * be the binary operation on Z defined in Example 4, a * b =a + 
b + 1. We will show that Z has an identity with respect to *. First we have 
to discover what the identity is. If e is the identity, then for all a in Z, we 
have a*e=a+e+1 =a, from which it follows that e must be —1. It is 
easy to verify that a * (-1) = (-1) «a = a for alla € Z. 


Fortunately for our terminology, the identity function i, is an identity element 
for composition on F(A). This fact is a consequence of Proposition 3.3.3. 


Let A be a nonempty set. In Example 6, we defined the binary operation * 
on P(A) by X* Y= XN Y if X and Y are subsets of A. This binary opera- 
tion has an identity, namely A itself, since for any X € P(A), XNA = 
ANX=X. 


In Example 7 we defined two binary operations on F(R), addition and 
multiplication of functions. Each of these binary operations has an identity. 
It is easy to see that the identity for addition is the constant function that 
maps every real number to 0 and the identity for multiplication is the constant 
function that maps every real number to 1. 


0 0 
Addition on M,(R) has an identity, the matrix ( ): The identity on 
M,(R) with respect to multiplication is not so clear. It is the matrix 


1 
e ) We leave the justification of this as an exercise. (See Exercise 9.) 


You may have noticed that we occasionally used the definite article here: 
we talked about “the” identity element. This really isn’t justified until we 
prove the next proposition. 


If * is a binary operation on A with identity element e, then e is unique. 
Again, a preliminary word about the method of proof. We are assuming 


the existence of one object, an identity, and must show that there are no 
others. The conclusion is not existentially quantified; we will not exhibit an 
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identity. Rather, a clever and common way to prove uniqueness is to assume 
that there is another such object, another identity, and then prove that in 
fact the two must coincide; thus there is really only one. 


PROOF: Suppose that e’ is also an identity element. Then e * e’ = e, since 
e € A. But on the other hand, e is an identity element, so we also have e * 
e’' = e’. Hence e = e’. © 


Inverses 


Another important concept associated with a binary operation is the notion 
of the inverse of an element. 


Definition 4.1.5 Suppose that * is a binary operation on A with identity 
e, and let a € A. We say that a is invertible with respect to * if there 
exists b € A such thata*b = b * a =e. If b exists, we say that b is an 
inverse of a with respect to *. 


Inverses may or may not exist. Every element x of Z has an inverse with 
respect to addition, namely its negative —x. However, very few integers have 
multiplicative inverses; in fact, only 1 and —1 do. (This fact is proved in 
Section 5.1.) For example, the reason that 2 does not have a multiplicative 
inverse is that 1/2 is not in Z. 

But each element of Q or R, except for 0, has a multiplicative inverse— 
its reciprocal. 


Consider again the binary operation a * b = 2a + b on Z defined in Example 
3. We’ve seen that this binary operation does not have an identity. Since 
the definition of the inverse of an element presupposes the existence of an 
identity, it follows that no element of Z has an inverse with respect to *. 


Let * be the binary operation on Z defined by a* b =a + b + 1. We have 
seen that —1 is the identity of Z with respect to this binary operation. 

Let a © Z. If a has an inverse x with respect to *, thena*x=at+x+t+ 
1 = —1. Solving for x gives 


x=-a- 2. 

One easily checks that 

a*(-a-— 2) =(-a-2)*a=—-1. 

Therefore every element of Z has an inverse with respect to *. 


Let A be a nonempty set. As we have seen, composition is a binary operation 
on F(A) and i, is the identity element. It is easy to see that the elements of 
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F(A) that have inverses with respect to composition are precisely the invert- 
ible functions in F(A). (See Definition 3.3.7.) And these functions are exactly 
those functions in F(A) that are bijective. (See Theorem 3.3.8.) 


Let A be a nonempty set. Consider again the binary operation defined on 
P(A) by X * Y= X OY. We have seen that A is the identity element of 
P(A) with respect to *. Since A * A = AN A = A, it follows that A is its 
own inverse. (In fact, an identity element is always its own inverse.) 

However, if X € P(A) and X # A, then ¥ N Y #A for all Y € P(A) 
since X¥ 1 Y C X. Therefore if X € P(A) and X # A, X does not have an 
inverse with respect to *. 


Every element of F(R) has an inverse with respect to addition of functions: 
namely, its negative —f. For example, the inverse of the function f: R > R 
defined by f(x) = 2x? — 3x + cos x is the function —f: R — R defined by 
(—f)(x) = —2x3 + 3x — cos x. 

For multiplication in F(R), the situation is a bit more complicated. If a 
function f € F(R) takes on the value 0 at even one point, say, f(x.) = 0, 
then no function multiplied by f(x) can equal 1 at x). Hence the functions 
in F(R) that have inverses are those that are never 0. The inverse of such 
a function is its reciprocal. For example, the multiplicative inverse of the 
function f: R — R defined by f(x) = x? + 1 is the function 1/f: R — R, 
defined by 


(jin; 


(Note: We use the symbol 1/f for the multiplicative inverse of a function and reserve 
the symbol f~! for the inverse with respect to composition.) 


Not every element of /,(R) has an inverse. In fact, the invertible elements 


a b 
of M,(R) are precisely those matrices ( for which ad — bc # 0. (See 


c 
Exercise 11.) 


In some cases, inverses have a uniqueness property like identities do. 
The result is stated next and the proof is left as an exercise (Exercise 12). 


Let * be an associative binary operation on a set A with identity element e. 
If a € A has an inverse with respect to *, then that inverse is unique. 


If the inverse of an element a € A is unique, we denote it by a“! and 
call it the inverse of a with respect to *. 
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Closure 


A subset of a set with a binary operation “inherits” a way to combine pairs 
of elements: they are, after all, elements of the larger set, and the operation 
can be applied. But where is the result? There is no guarantee that it is in 
the subset. We single out the case when in fact we have an operation on 
the subset. 


Definition 4.1.7 Let * be a binary operation on the set A, and suppose 
that X C A. If * is also a binary operation on X (that is, if x * y © X, 
Vx, y © X), then X is said to be closed in A under +. 


Note that the definition of closure is a statement with a universal quan- 
tifier. 


Z* is closed in Z under addition and multiplication. This fact is one of the 
axioms of the integers and will be discussed in Section 5.1. 


The set N = {n € Z| n < 0}, the set of negative integers, is closed in Z 
under addition, but not under multiplication. (The formal proof of these 
facts will be an exercise in Section 5.1.) 


Let E = {n € Z| n is even}. E is closed in Z under both addition and 
multiplication. Here is a proof for the closure of addition. 
Let a, b € E. Then a = 2a, and b = 2b, where a,, b, € Z. Hence 


a+ b = 2a, + 2b; = 2(a + by). 


Since a, + b} © Z,a + b = 2(a, + b,) EE. 
The proof for closure of multiplication is left as an exercise. 


Let A be a set with a binary operation *. Let X be a subset of A. To 
prove that X is not closed under *, we need to express the negation of 
the definition of closure. Let P(x, y) be the open sentence “‘x, y © X”’ and 
Q(x, y) be the open sentence ‘‘x * y © X.” Then X is closed under * if 
Vx, y G A, P(x, y) > O(%, y). Thus X is not closed under * if there exist 
x, y © A such that P(x, y) is true and Q(x, y) is false. In other words, 4x, 
y © A such thatx, ye Xbutx*+yEX. 


The set O = {n € Z | n is odd} is closed in Z under multiplication but not 
under addition. We leave the proof that O is closed under multiplication as 
an exercise. (See Exercise 16.) 

To see that O is not closed under addition, we need only find integers x 
and y such that x and y are odd but x + y is even. One example is x = 3 
and y = 5. 
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Let C(R) = {f € F(R) | fis continuous on R}. Then C(R) is closed in F(R) 
under both addition and multiplication. This follows from a result of calculus 
that says that the sum and product of continuous functions are continuous. 


Let X = {f € F(R) | f(x) € Z, Vx © R}. This is the set of integer-valued 
functions in F(R). X is closed in F(R) under composition. 

To see this, let f g © X; we choose x € R, and we must show that 
fg(x) © Z. But fg(x) = f(g(x)) € Z, since g(x) EZ CR. 


ab 

Let S = {( ’) € M,(R) |a= i} We will show that S is closed under 
Cc 

addition but not closed under multiplication. 


a b x y 
Let ( \. ) © 5. Then a = x = 0, Hence 
c a ZW 


a b x y 0+0 bty 0) bt+y 
+ = = ES. 
c d Zw c+z d+tw c+z dt+w 
Therefore, S is closed under addition. 


To prove that S is not closed under multiplication, we merely have to 
show that there exist two elements of S whose product is not in S. Two such 


01 0 O 
elements are ( and ( ) 
1 0 -1 0 


@® MATHEMATICAL PERSPECTIVE: GROUPS 


In a number of the examples of binary operations * on a set A done in this 
section, the operation was associative, the set A had an identity with respect 
to *, and every element of A had an inverse. Mathematicians have given a 
name to any set A with this property. It is called a group. The notion of 
group was mentioned informally in the comments at the end of Section 3.2. 

The set Z of integers is a group with respect to the binary operation of 
ordinary addition. Z is also a group with respect to the binary operation 
defined in Example 4. It is not a group with respect to the binary operation 
in Example 3, however, because that binary operation is not associative. Z 
is also not a group with respect to the binary operation of multiplication 
because, although multiplication is associative and there is an identity, only 
1 and —1 have inverses with respect to multiplication. 

Now consider the set F(A) of all functions from a nonempty set A to 
itself. We saw in Example 5 that composition is a binary operation on F(A) 
and composition is associative. Moreover, the identity function i, is the 
identity element of F(A) with respect to composition. However, not every 
function in F(A) has an inverse with respect to composition. In fact the ones 
that do are the bijective functions from A to A. 
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To get a group from this set of functions, we focus on a subset of F(A). 
Let S(A) be the set of all bijective functions from A to A. Then S(A) is 
closed in F(A) with respect to composition. (See Exercise 22 following.) This 
means that composition is a binary operation on S(A). It is an associative 
binary operation on S(A), since the associative property is inherited from 
F(A); that is, (fg)h = f(gh) for all f, g, h in S(A) because (fg)h = f(gh) for 
all f g, h in F(A). The identity function i, is bijective and is therefore in 
S(A). Thus S(A) has an identity with respect to composition. Finally, all of 
the elements of S(A) have inverses with respect to composition because they 
are invertible functions. Moreover, these inverses are themselves invertible 
and therefore in S(A). 

Thus S(A) is a group with respect to composition. This type of group is 
called a permutation group, because the elements of S(A) are permutations 
of the set A. We discussed some examples of permutation groups at the end of 
Section 3.2. Groups appear in many areas of mathematics including number 
theory, geometry, and topology. The study of groups, called group theory, 
has been a thriving area of mathematical research for many years. 


Exercises 4.1 


1. (a) Prove that division is a binary operation on R — {0}. 
(b) Prove that division on R — {0} is neither associative nor commutative. 


2. Determine whether each of the following binary operations is associative 
and/or commutative. Prove your answers. 
(a) OnZn*m=n+m+2 
(b) On Zn*m=6+nm 
(c) On Z,n* m = nm? 
(d) On Z*, n * m = max(n, m), the larger of n and m 
(e) On Z*, n * m = min(n, m), the smaller of n and m 
(f) On P(A), for any set A, X¥* Y=XUY 
(g) On R, x * y = 2” 
(h) On Z*,n * m =n" 
3. (a) Let A be a nonempty set. Prove that if A has more than one element, 


then composition on F(A) is not commutative. 
(b) Prove that if |A] = 1, then composition on F(A) is commutative. 


4. (a) Prove that addition on F(R) is commutative. 
(b) Prove that multiplication on F(R) is associative. 
(c) Prove that multiplication on F(R) is commutative. 


5. (a) Prove that addition on M,(R) is associative. 
(b) Prove that multiplication on M,(R) is not commutative. 


6. Does division on R — {0} have an identity element? If so, what is it? 


7. For each of the binary operations in Exercise 2, determine which ones 
have identities. Prove your answers. 


4.1 


10. 


11. 


12. 
13. 


14. 


15. 


16. 


17. 


18. 


19. 
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. Compute: 


Ole ua) Ole glen.) 


1 0 
. Prove that the matrix (‘ ‘) is the identity of M,(R) with respect to 


multiplication. 


For each of the binary operations in Exercise 2 that has an identity, 
determine which elements of the underlying set have inverses. 


a b 
Prove that ( is an invertible element of M,(R) if and only if 


c 
ad — bc #0. 


Prove Proposition 4.1.6. 


Let * beanassociative binary operation onasetA withidentity elemente. 

(a) Prove that e is invertible. 

(b) Prove that if a € A is invertible, then a™! is invertible and 
(a')1 =a. 


Let * beanassociative binary operation ona set A with identity element 

e, and leta,bE A. 

(a) Prove that if a and b are invertible, then a * b is invertible. 

(b) Prove that if A is the set of real numbers R and * is ordinary 
multiplication, then the converse of part (a) is true. 

(c) Give an example of a set A with a binary operation * for which the 
converse of part (a) is false. 


Consider the statement P: “Every commutative binary operation is asso- 
ciative.” 

(a) What is ~P? 

(b) Precisely one of the statements P and —P is true. Prove it. 


(a) Prove that the set E of even integers is closed in Z under multipli- 
cation. 

(b) Prove that the set O of odd integers is closed in Z under multipli- 
cation. 


Let S be the set of nonzero real numbers. 
(a) Is § closed under addition? Prove your answer. 
(b) Is S closed under multiplication? Prove your answer. 


Let n € Z and let nZ be the set of multiples of m nZ = {a € Z| a = 
nt for some t € Z}. Prove that nZ is closed in Z under ordinary addition 
and multiplication of integers. 


Let * be the binary operation on Z defined by a * b = a + 2b. Prove 
or disprove that each of the following subsets is closed in Z under *. 
(a) The set E of even integers 
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20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


(b) The set O of odd integers 
(c) The set nZ of multiples of a fixed integer n 


Let * be the binary operation on Z defined by a * b =a + b + 1. Prove 
or disprove that each of the following subsets is closed in Z under *. 
(a) The set E of even integers 

(b) The set O of odd integers 

(c) The set nZ of multiples of a fixed integer n 


Let D(R) = {f € F(R) | f is differentiable on R}. Prove that D(R) is 
closed in F(R) under addition and multiplication. (You may use without 
proof any result from calculus you need, but state that result carefully.) 


Let A be a nonempty set. Determine whether or not the following sets 
are closed in F(A) under composition. Prove your answers. 

(a) {f € F(A) | f is injective}. 

(b) {f € F(A) | f is surjective}. 

(c) {f € F(A) | f is bijective}. 


Determine whether or not the following sets are closed in F(R) under 
addition. Prove your answers. 

(a) {f € F(R) | f(0) = 0}. 

(b) {f € F(R) | f(0) = 1}. 

(c) {f © F(R) | f’(0) = 0}. 

(d) {f € F(R) | f(—x) = f(x) for all x € R}. 

(e) {f © F(R) | f(—x) = —f(x) for all x € R}. 

Determine whether or not the sets in Exercise 23 are closed in F(R) 
under multiplication. Prove your answers. 


Determine whether or not the sets in Exercise 23 are closed in F(R) 
under composition. Prove your answers. 


Determine whether or not the following sets S are closed in M,(R) under 
addition. Prove your answers. 


(a) S= {(? ’) E MR) |b -o. 
(b) S= {(° ’) € M,(R) |c = i} 


(c) S= {(° ’) € M,(R)|a= a} 


(d) s={(" *)emcn) 3a +45 - To + d=}, 


Determine whether or not the following sets S are closed in M,(R) under 
multiplication. Prove your answers. 


(a) S= {(? ’) E M,(R) | b = of. 


4. 


28. 


29. 


30. 


31. 


32. 


33. 
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(b) S= {(" |) E M,(R)|d = of. 


a b 
(c) S= ( ) © mse) |a= a}. 
cd 
ab 
(d) S= ( Jem) |ascaez} 
c od 
A matrix A in M,(R) is called a nonzero matrix if A is not the matrix 


0 0 
(, | Let S be the statement ‘‘The product of two nonzero matrices 


in M,(R) is a nonzero matrix.” 

(a) Write S in the form: ‘For all... if...then....” 
(b) Write the negation of S. 

(c) Prove or disprove S. 


Let * be an associative binary operation on a set A with identity element 
e. Let B = {a € A | a is invertible with respect to *}. Prove that B is 
closed in A under *. 


Let * be an associative binary operation on a set A with identity element 

e. Suppose a, b, c € A and that a is invertible. 

(a) Prove that ifa*b=a*c, thenb=c. 

(b) Give an example to show that this ‘‘cancellation law” is not true if 
a is not invertible. 


Let « be a binary operation on a set A. Complete the following state- 
ments: 

(a) * is not associative if.... 

(b) * is not commutative if .... 

(c) x € A is not an identity element of A if.... 

(d) a € A does not have an inverse if... . 


Let « be a binary operation on a set A. Let B and C be subsets of A 

that are closed under *. 

(a) Prove that B M C is closed under *. 

(b) Give an example to show that B U C need not in general be closed 
under *. 


Let A be a nonempty set. In Example 6, we defined the binary operation 
* on P(A) by X * Y = X 1 Y if X and Y are subsets of A. Let X¥ € 
P(A). Since X * X = X N X = X, does this mean that X is the identity 
for this binary operation? Explain. 


Let * be an associative binary operation on a set A. Suppose that A has 
an identity e with respect to *. Leta € A. We say that a has a left inverse 
with respect to * if there exists b € A such that b * a = e. Similarly, we 
say that a has a right inverse with respect to * if there exists c € A such 
thata*c =e. 
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35. 


36. 


37. 


(a) Prove that if an element a € A has a left inverse 5 and a right inverse 
c, then a is invertible and a! = b =. 

(b) Suppose that a € A has a left inverse b and that b is not a right inverse 
for a. Prove that a has no right inverse and that a is not invertible. 

(c) Give an example of a set with an associative binary operation having 
an identity for which there is an element with a left inverse but no 
right inverse. Hint: See Exercise 2 of Section 3.3. 


Prove that the set M,(R) is a group with respect to matrix addition but 
is not a group with respect to multiplication. 


a b 
Prove that the set {( ’) € M,(R) | ad -— bc # o} is a group with re- 
Cc 


spect to matrix multiplication. 


res={(* “\jaer}. 


(a) Prove that S is closed in M,(R) with respect to matrix multiplication. 

(b) Prove that S has an identity with respect to matrix multiplication. 

(c) What elements of S have inverses with respect to matrix multiplica- 
tion and the identity you found in part (b)? Prove your answer. 

(d) Prove that the set T of nonzero matrices in S forms a group with 
respect to matrix multiplication. 


Discussion and Discovery Exercises 


D1. 


D2. 


D3. 


Let * be an associative operation on a set A. Explain how the definition 
of invertible element implies that if an element a in A is invertible, then 
a”! is invertible and (a!) = a. 


Let « be a binary operation on a set A. Suppose that A has an identity 
e with respect to *. Let B be subset of A and suppose that B is closed 
in A with respect to *. Does it follow that B must have an identity with 
respect to *? Must that identity, if it exists, be e? Explain with proofs 
and examples. 


Consider the set M,(R) with respect to the binary operation of matrix 
multiplication. We say that a matrix A has finite order if some power 


1 0 
of A is the identity matrix (‘ ) Specifically, we say that A has order 


1 0 
nif A* = ( ) where n is a positive integer and n is the smallest 


positive integer with this property. For example, the matrix 


-1 0 -1 0O,\ 1 0 
( ) has order 2, since ( ) = ( : 
0 -il 0 -i1 0 1 


4.2 


4.2 EQUIVALENCE RELATIONS 139 


D4. 


DS. 


(a) Find a matrix of order 4. 
Hint: The entries of the matrix will just involve the integers 0, 1, and —1. 


(b) Find a matrix of order 3. 
Hint: The entries of the matrix will not be integers. 


(c) Find a matrix that does not have finite order. 


Consider the set M,(R) with respect to the binary operation of matrix 
1 0 
multiplication. Let J denote the identity matrix [ a It is shown in 


linear algebra that if a 2 X 2 matrix A has a left inverse B then A is 
invertible and B = A™!. That is, if there exists a 2 X 2 matrix B such 
that BA = J, then A is invertible and B = A”. (See Exercise 34.) 

(a) Look up the proof of this result in a linear algebra text and write 


a clear explanation of the proof. 
Hint: The result is usually stated for n X n matrices and involves the connec- 
tion between an n X n matrix being invertible and the solution of n X n 
linear systems. 

(b) Prove that if a 2 X 2 matrix A has a right inverse C then A is 


invertible and C = A™!. 
Hint: This problem can be done just using part (a) and Exercise 13(b) of 
this section. 


In Proposition 4.1.6, we proved that if * is an associative binary operation 

on a set A with identity element e and if a € A has an inverse with 

respect to *, then that inverse is unique. The proof uses the fact that * 

is associative. The following example is cooked up to show that for 

some binary operations, an element may have more than one inverse. 
We define a binary operation on R as follows. If a, b € R, let 

a*xb=a+b-— (aby. 

(a) Prove that * is commutative but not associative. 

(b) Find the identity element of R with respect to *. 

(c) Prove that 1 has two inverses with respect to *. 

(d) Prove that any real number a has at most 2 inverses with respect to *. 

(e) Which real numbers have no inverse with respect to *? Which ones 
have exactly one inverse? Which ones have two inverses? 


EQUIVALENCE RELATIONS 


One of the most basic concepts of mathematics, one that we are all familiar 


with from our earliest days in a math class, is equality. But this very strict 


idea of two objects being equal is often not what we mean when we use the 


word in everyday life. For example, baseball fans throughout America can 


be classified by their team allegiance: there are Red Sox fans, Cubs fans, 
and so on. From the point of view of a team trying to sell tickets, any two 
of its fans (with money!) can be considered equivalent, although they are 
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certainly not the same person. This ‘“‘equality”’ is a generalization of the 
strict notion, considering two objects equivalent because they share some 
property, or are in some relation to each other. In this section we try to 
capture this generalization mathematically. 


Relations 


Definition 4.2.1 A relation R on a set A is a subset of A X A. If 
(a, b) € R, we write aRb. 


The set R = {(0, 0), (0, 1), (2, 2), (7, 18)} is a relation on the set Z*. We 
would thus write ORO, OR1, 2R2, and 7R18. 


Let R = {(a, b) € Z X Z| b = na for some n € Z}, a relation on the set Z 
of integers. Then, for example, 2R4, (—3), R(—42), and 7RO. In general, aRb 
if b is a multiple of a. 


On the set of baseball fans, define aRb if a and b are fans of the same team. 


We can define a relation R on the set of all people by aRb if persons a and 
b have at least one parent in common. 


A different relation S on the set of all people can be defined by aSb if 
persons a and b are siblings; that is, they have both parents in common. 
Notice that S is a subset of the relation R in Example 4. In fact, it’s a proper 
subset, since for example Elizabeth I and Edward VI of England were both 
children of Henry VIII, but had different mothers (with different fortunes). 


Define a relation R on the set R of real numbers by aRb if a < b. So for 
example, 5R5.5 and 7R4. 


Let S be the set of all finite subsets of Z. Define R on S by ARB if |A| = |B]. 


Properties of Relations 


The following definition lists four important properties that a relation may 
have. 


Definition 4.2.2 Let R be a relation on a set A. We say: 


1. R is reflexive if aRa, Va € A. 

2. R is symmetric if for all a, b € A, if aRb, then bRa. 

3. R is transitive if for all a, b, c € A, if aRb and bRc, then aRc. 

4. R is antisymmetric if for all a, b € A, if aRb and bRa, then a = b. 
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Let R be the relation of Example 1. Then R is not reflexive because 1 is not 
related to 1. 

R is not symmetric because 0R1 but 1 is not related to 0. 

R is transitive since the only instance in which aRb and bRc is when 
a = 0, b = 0, and c = 1, and in this case aRc. 

R is antisymmetric because there are no integers a and b such that aRb 
and bRa anda # b. 


Let R be the relation on Z defined by aRb if b = na for some integer n, as 
in Example 2. 

Let a € A. Since a = 1a, aRa. Therefore R is reflexive. 

R is not symmetric since, for example, 2R4 but 4 is not related to 2. 

R is transitive. Let a, b, c © Z and suppose that aRb and bRc. Then 
b = na for some n € Z and c = mb for some m € Z. Hence c = mna and 
because mn is an integer, it follows that aRc. 

R is not antisymmetric since 2R(—2) and (—2)R2 but 2 # —2. 


It follows from the order properties of the real numbers (see Section 5.1) 
that the relation R on the set R of real numbers defined by aRb if a < b is 
transitive but not reflexive or symmetric. 

This relation may not seem to be antisymmetric since for any real numbers 
a and b, we cannot have a < } and b < a. But it is for precisely this reason 
that it is antisymmetric. The definition of antisymmetric requires that for all 
a, b E R, the implication a < b and b < a imply a = b. But remember that 
an implication P => Q is true whenever the premise P is false. In this case 
the premise a < 5 and b < ais always false. Therefore for every choice of 
real numbers for the variables a and b, the implication a < b andb <a 
implies that a = b is true. Hence R is antisymmetric. 


Note that to prove a relation on a set A symmetric requires proving a 
statement with a universal quantifier and an implication. For every a, b in 
A, it is necessary to prove that if aRb, then bRa. Similarly, the transitive 
property is a statement with a universal quantifier and an implication. For 
every a, b, c in A, if aRb and bRc, then aRc. The same is true for the 
antisymmetric property. The reflexive property however has a universal 
quantifier but does not contain an implication. It is merely necessary to show 
that aRa for alla € A. 

Relations that have the properties of being reflexive, symmetric, and 
transitive are particularly important. 


Equivalence Relations 


Definition 4.2.3 A relation R on aset A is called an equivalence relation 
if it is reflexive, symmetric, and transitive. 
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On any set A, the relation of equality, aRb if a = b, is clearly an equivalence 
relation. We would certainly hope this were true, since we are attempting 
to generalize this very notion! But there are many other relations that can 
be seen to satisfy the three properties of the definition. 


Consider again the set of baseball fans, and the relation defined in Example 
3. Let’s assume, quite reasonably, that no one can really be a fan of two 
different teams. Then it’s easy to see that the three properties are satisfied. 


On Z, define aRb if |a| = |b|, so for example, 2R(—2) and 4R4. 

Since |a| = |a|, Va € Z, R is immediately seen to be reflexive. 

To see that R is symmetric, let a, b © Z and suppose that aRb. Then 
|a| = |b|, which implies that |b] = |a| and thus bRa. 

Now suppose that a, b, c € Z and that aRb and bRc. Then |a| = |b| and 
|b| = |c| and hence |a| = |c|. Therefore aRc, proving that R is transitive. 


For a less trivial example, consider the relation R on Z defined by aRb if a 
and b are either both even or both odd. Because every integer is either even 
or odd, aRa, Va € A, and R is reflexive. Symmetry is obvious. Finally, 
suppose that a, b, c € Z and aRb and bRc. Then if b is even, so are a and 
c, and we have akc. Similarly, if b is odd, we get aRc. So R is transitive. 


The relation R in the previous example could be defined in a different way 
by recognizing that aRb if a — b is even. An integer is even if and only if 
it is divisible by 2. This insight leads us to try the same thing with an integer 
other than 2, say 3. 

Define aRb if a — b is divisible by 3; that is, a — b = 3k, for some 
k © Z. We must check the three properties to prove that R is an equiv- 
alence relation. 

Reflexive: Ifa € Z,a -a =0=3:-0,soaRa. 

Symmetric: If aRb, then a — b = 3k, for some k € Z. But then b — a = 
3(—k), and so bRa. 

Transitive: If aRb and bRc, then a — b = 3k,, b — c = 3k,, for some k,, 
k, € Z. Thena — c = (a — b) + (6 — c) = 3k, + 3k, = 3(k; + k,), so 
aRc. 


You may have noticed that the number 3 played no important role in the 
previous argument, and could just as easily be replaced by any positive 
integer. Therefore, for n € Z*, we define a relation R on Z, called congruence 
mod n, by aRb if a — b = nk for some k € Z. This is usually written a = 
b(mod n). You are asked in the exercises to check that congruence mod n 
is in fact an equivalence relation. We will see a good deal more of congru- 
ence later. 
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When R is an equivalence relation, it is common to write a ~ b instead 
of aRb, read as “‘a is equivalent to b.”” As we mentioned before, we want 
to generalize equality and view two elements related by an equivalence 
relation as ‘‘equivalent.” If we group related elements together, we get what 
is known as an equivalence class. 


Equivalence Classes 


Definition 4.2.4 If R is an equivalence relation on A, and a € A, the 
set [a] = {x € A |x ~ a} is called the equivalence class of a. Elements 
of the same class are said to be equivalent. 


The following examples refer to the six examples of equivalence relations 
(11-16) previously given. 


Since in any set A the only element equal to an element a is itself, [a] = {a}. 


The class of a single baseball fan is simply the set of all fans who root for 
the same team as that person. Thus, for example, [John Updike] is the set 
of all Red Sox fans. 


Every nonzero element of Z can be associated with exactly one other element, 
its opposite, with the same absolute value. Thus, [a] = {a, —a}. Notice that 
every class has two elements except [0]. 


Even numbers are equivalent, as are odd numbers. Hence there are two 
classes: the set of even integers E and the set of odd integers O. 


Integers with the same remainder are equivalent, so there are three classes: 


Notice that we could also have called these classes [6], [—5], and [20]. 


This example is similar to the previous one, but there are n possible remain- 
ders; namely, 0, 1, 2,... — 1. Thus there are n equivalence classes, and 
we usually write them as [0], [1], ... [a — 1]. 

You may have sensed that an equivalence relation divides the set in 
question into classes; we have distinct collections of Cubs fans, of Dodgers 
fans, and so on. In fact, the set of baseball fans is partitioned into sets with 
allegiance to different teams, as the following theorem shows. 
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If R is an equivalence relation on a nonempty set A, then the set of equiva- 
lence classes of R forms a partition of A. 


PROOF: Recall that a partition of the set A is a collection of nonempty 
subsets of A such that the union of these sets equals A and no two of the 
sets intersect. Since each equivalence class is a subset of A, so is the union 
of these equivalence classes. To prove that the union of the equivalence 
classes equals A, we must show that every element of A is in some equivalence 
class. But if a € A, then a ~ a since R is reflexive, so a € [a]. Note that this 
also shows that every equivalence class is a nonempty set. 

Now we must also show that different equivalence classes do not intersect. 
Suppose, on the contrary, that there exist distinct equivalence classes [a] and 
[b] such that [a] M [b] # ©. Let x € [a] MN [b]. Then, in particular x ~ a, 
and since R is symmetric, a ~ x. But also x ~ b since x € [b]. Since R is 
transitive, it follows that a ~ b. Therefore a € [b]. Now take an arbitrary 
y € [a], and notice that since y ~ a and a ~ b, we have y ~ b and hence 
y € [b]. We have shown that in fact [a] C [b]. However, we could start this 
same argument with y € [b] and conclude that [b] C [a], and so [a] = [Db]. 
This is a contradiction to our assumption that [a] # [b], so we can conclude 
that [a] N [b] = ©. 


The previous theorem shows that any equivalence relation on a set A 
leads to a partition of A. The next theorem shows that any partition of a 
set A gives rise to an equivalence relation on A. 


Let Y be a partition of a nonempty set A. Define a relation R; on A by 
aR,b if a and b are in the same element of the partition. Then R; is an 
equivalence relation on A. 


PROOF: If a € A, then a € X for some X € ¥ so clearly aR,a. It’s equally 
obvious that if aR,b, so that a, b © X, for some X € Y then we can just as 
well say b, a © X, and bR,a. Finally, if aR,b and bR\c, we have a, b © X 
and b, c € Y, for some X, Y € & But since is a partition of A and b € 
X 1 Y # ©, we must have X = Y. Therefore a, c € X, and aR,c. Thus R; 
is reflexive, symmetric, and transitive, and hence an equivalence relation. ** 


Theorems 4.2.5 and 4.2.6 are related in the following way. Let R be an 
equivalence relation on the nonempty set A. Theorem 4.2.5 says that the set 
Pf of equivalence classes of R is a partition of A. Theorem 4.2.6 says that 
this partition gives rise to an equivalence relation R, of A. From the definition 
of this equivalence relation given in the proof of Theorem 4.2.6, it follows 
that R; = R. 

Similarly, if we start with a partition 7 of A, then there is a corresponding 
equivalence relation R,; by Theorem 4.2.6. Theorem 4.2.5 says that there 
is a partition determined by this equivalence relation, namely, the set of 
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equivalence classes of R,. But this set is just the partition 7 that we 
started with. 

These remarks tell us that there is a bijection between the set of all 
equivalence relations of A and the set of all partitions of A. 


Partial and Linear Orderings 


The relation R on the real numbers R defined by aRb if a = 5 is not an 
equivalence relation because it is not symmetric. However, R is reflexive, 
transitive, and antisymmetric. It is an important relation because it gives an 
ordering of the real numbers: it defines what it means for one number to be 
“Jess than’ another. The notion of partial ordering is a generalization of 
this example. 


Definition 4.2.7 A relation R on a set A is called a partial ordering on 
A if R is reflexive, transitive, and antisymmetric. 

If A is a set and there exists a partial ordering on A, then we say 
that A is a partially ordered set. 


As just noted, the relation = is a partial ordering on R. It is also a partial 
ordering on Z. Note, however, that the relation < is not a partial ordering 
on R. 


Let A be a set and P(A) be the power set of A. The relation R on P(A) 
defined by XRY if X C Y is a partial ordering on P(A). We leave the 
verification as an exercise. 


The partial ordering = on R has the property that for all a, b © R, 
either a = b or b = a. Any partial ordering with this property is called a 
linear ordering. 


Definition 4.2.8 Let A be a set and R be a partial ordering on A. We 
say that R is a linear ordering on A if for all a, b € A, either aRb or bRa. 

If A is a set and there exists a linear ordering on A, then we say that 
A is a linearly ordered set. 


As we just saw, the relation = is a linear ordering on R. 


Let A be a set and P(A) be the power set of A. The relation R on P(A) 
defined by XRY if X C Y is not a linear ordering on P(A) except in the 
special cases where A is the empty set or A contains only one element. 


146 


CHAPTER 4 BINARY OPERATIONS AND RELATIONS 


“=P HISTORICAL COMMENTS: THE RATIONAL NUMBERS 


After the theory of irrational numbers was firmly established by Weierstrass, 
Dedekind, and Cantor (see Section 2.2) among others, a formal definition 
of the rationals was undertaken, with the properties of the integers assumed 
as known. This definition gives us a nice application of equivalence relations. 

Assuming the existence of the set of integers Z, we look at the set Z X 
Z of ordered pairs of integers. We can think of a rational number as an 
ordered pair (a, b) of integers and intuitively associate the pair (a, b) with 
the fraction a/b. So, for instance, the fraction 1/2 is the ordered pair (1, 2), 
the fraction 2/3 is the ordered pair (2, 3), and so on. The denominator of a 
fraction cannot be zero, of course, so we consider just the set S = {(a, b) € 
Z X Z| b ¥ O}. Thus we define the rational numbers using only a subset of 
ZX Z. 

We know, however, from our introduction to fractions in grade school 
that a rational number has more than one representation. For example, 
1/2 = 2/4 = 3/6 = 12/24, 2/3 = 4/6 = 12/18, and so forth. Hence the ordered 
pairs (1, 2) and (3, 6) represent the same rational number. For the purposes 
of a formal definition of the rational numbers, then, we need to express 
certain ordered pairs such as (1, 2) and (3, 6) as one element. We do this 
by defining a relation on the set S of ordered pairs of integers (a, b) with 
b # 0. How should such a relation be defined? Well, we need to look at the 
reason why the fractions 1/2 and 3/6 are equal and generally why any two 
fractions a/b and c/d are equal. An easy algebraic calculation tells us that 
a/b = c/d if and only if ad = bc. Therefore, we define a relation R on S by 
(a, b)R(c, da) if ad = bc. The next step is to verify that R is an equivalence 
relation on S. We leave the proof as an exercise. (See Exercise 18.) 

Once we know that R is an equivalence relation, we can look at the set 
of equivalence classes. If (a, b) € S, for simplicity of notation, we write 
(a, b] for the equivalence class of (a, b), instead of [(a, b)]. Thus [a, b] is the 
set of all ordered pairs (x, y) where y # O and ay = bx. For example, 
(1, 2] = {(1, 2), (2, 4), 3, 6), (4, 8), (—5, -10), ... }. So all of the differ- 
ent representations of the fraction 1/2 are united in one element, the class 
{1, 2]. Finally, we can define the set of rational numbers Q as the set of all 
equivalence classes [a, 5]. 

We learn early in grade school the rules for adding and multiplying 
fractions. Using these rules as a model, we can give a formal definition of 
addition and multiplication on Q. 

If [a, b], [c, d] € Q, we define 


[a, b] + [c, d] = [ad + bc, bd] and _ [a, b] - [c, d] = [ac, bd]. 


Note that since b ¥ 0 and d # 0, then bd ¥ 0, so that [ad + bc, bd] and 
{ac, bd] are actually in Q. 

Our definitions of addition and multiplication on Q are not yet complete, 
however. They seem to depend on the choices of a, b, c, and d to represent 


42 EQUIVALENCE RELATIONS 147 


the equivalence classes. For example, using the definition of addition, 
(1, 2] + (2, 3] = [(1)(3) + (2)(2), (2)(3)] = [7, 6]. But we can also write 
{1, 2] as [4, 8] and [2, 3] as [12, 18]. Do we get the same answer if we use 
these new representations? Let’s do the computation and see: 


[4, 8] + [12, 18] = [(4)(18) + (8)(12), (8)(18)] = [168, 144]. 


But (168, 144] = [7, 6] since (168)(6) = (144)(7) = 1008. So in this instance 
at least we can use different representations of [1, 2] and [2, 3] and arrive 
at the same sum. 

Now we must verify that in all cases addition in Q does not depend 
on the representation of the equivalence classes. So suppose that we wish 
to add [a, b] and [c, d]. Let [x, y] be another representation of [a, b] 
and [z, w] be another representation of [c, d]. Then (a, b)R(x, y) and 
(c, d)R(z, w). This means that ay = bx and cw = dz. 

[a, b] + [c, d] = [ad + bc, bd] and [x, y] + [z, w] = [xw + yz, yw]. 
To show that these two answers are the same we must show that (ad + 
bc)(yw) = (bd)(xw + yz): 


(ad + bc)(yw) = adyw + bcyw 
= aydw + cwby 
= bxdw + dzby 
= bd(xw + yz). 


This completes the proof that addition on Q does not depend on the 
choices of integers to represent the equivalence classes. We say that addition 
on Q is well defined. 

We leave the proof that multiplication on Q is well defined as an exercise. 

As you might expect, addition and multiplication on Q are both associa- 
tive and commutative and Q has an identity for addition and an identity for 
multiplication. Also, every element of Q has an inverse with respect to 
addition and every element of Q except the additive identity has an inverse 
with respect to multiplication. All of these properties are left to the exercises. 


Exercises 4.2 


1. For each of the following relations on Z, determine whether it is reflexive, 
symmetric, transitive, or antisymmetric. 
(a) aRb if a> b. 
(b) aRb if a = —b. 
(c) aRb if b = Sa. 
(d) aRbifasb +1. 


2. Let A = {l, 2, 3}. Let R be the relation on A defined by aRb if a + 
b # 3. Determine whether R is reflexive, symmetric, transitive, or anti- 
symmetric. 
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10. 


11. 


. Let A be a nonempty finite set and let P(A) be the power set of A. 


Determine whether each of the following relations on P(A) is reflexive, 

symmetric, transitive, or antisymmetric. 

(a) XRY if X C Y. (Recall: The symbol C means that X is a proper 
subset of Y.) 

(b) XRY if XN Y=. 

(c) XRYiIf XN YS. 

(d) XRY if |X| = |Y|. 

(e) XRY if |X| S |Y|. 

(f) XRY if |X| <|Y|. 


. For each of the following relations, determine whether it is reflexive, 


symmetric, or transitive. For those that are equivalence relations, give 

the equivalence classes. 

(a) On F(R), fRg if f() = g(0). 

(b) On Z X Z, (a, b)R(c, d) if ad = be. 

(c) On A X B, for any sets A and B, (a, b,)R(a:, br) if ay = a. 

(d) Let A be a nonempty set and let f € F(A). Define a relation on A 
by aRb if f(a) = f(b). 

(e) OnQ,rRs Sr-sEZ. 


. Let R be the relation of congruence mod 4 on Z: aRb if a — b = 4k, 


for some k € Z. 

(a) What integers are in the equivalence class of 18? 

(b) What integers are in the equivalence class of 31? 

(c) How many distinct equivalence classes are there? What are they? 


. Repeat Exercise 5 for congruence mod 5. 
. Letn € Z*. Prove that congruence mod v is an equivalence relation on Z. 


. Let R be a relation on a set _A. Complete the following sentences. 


(a) R is not reflexive if.... 

(b) R is not symmetric if .... 

(c) R is not transitive if .... 

(d) R is not antisymmetric if .... 


. Let A be the set of all people in the United States. Let R be the relation 


on A defined by: a is related to b if a and b were born on the same day 
of the week. 

(a) Prove that R is an equivalence relation on A. 

(b) Describe the equivalence classes for R. How many are there? 


Let R be the relation on Z defined by aRb if a* — b’ is divisible by 4. 
(a) Prove that R is an equivalence relation on Z. 
(b) What are the equivalence classes for this equivalance relation? 


Let A be a nonempty set and P(A) be the power set of A. 
(a) Prove that the relation R on P(A) defined by XRY if X C Yisa 
partial ordering on P(A). 
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(b) Prove that R is a linear ordering if A has only one element and is 
not a linear ordering if A has more than one element. 


12. Let R be the relation on Z* defined by aRb if b is a multiple of a; that 
is, if there exists nm € Z such that b = na. 
(a) Prove that R is a partial ordering on Z*. 
(b) Determine whether or not R is a linear ordering on Z*. 


13. Let R be a relation on a set A. Complete the following sentence: R is 
not a partial ordering on A if.... 


14. Let R be a partial ordering on a set A. Complete the following sentence: 
R is not a linear ordering on A if .... 


15. Let * be a binary operation on a set A. Assume that * is associative and 
that A has an identity e with respect to *. Let R be the relation on A 
defined as follows: if a, b € A, then aRb if there exists an invertible 
element c € A such that b = c! * a * c. Prove that R is an equivalence 
relation on A. 


16. Let F(R) be the set of all functions f: R — R. Define a relation R on 
F(R) by fRe if f(x) S g(x) for allx ER. 
(a) Prove that R is a partial ordering on F(R). 
(b) Determine whether or not R is a linear ordering on F(R). 


17. Let R be a relation on a set A. 

(a) Prove that R is both symmetric and antisymmetric if and only if 
R C {(a, a) |a € A}. Equivalently, R is both symmetric and antisym- 
metric if and only if for all a, b € A, either a is not related to b or 
a=b. 

(b) Prove that if R is both symmetric and antisymmetric, then R is tran- 
sitive. 

(c) If R is both symmetric and antisymmetric, does it follow that R is 
reflexive? Explain. 


In the following exercises, we use the definition of the rational numbers Q as 
given in the Historical Comments at the end of this section. 


18. Let S = {(a, b) € Z X Z| b # O}. Let R be the relation on S defined 
by (a, b)R(c, a) if ad = bc. Prove that R is an equivalence relation on S. 


19. Prove that multiplication on Q is well defined in the same sense that we 
showed that addition on Q is well defined. 


20. (a) Prove that addition on Q is associative. 
(b) Prove that addition on Q is commutative. 


21. (a) Prove that multiplication on Q is associative. 
(b) Prove that multiplication on Q is commutative. 


22. (a) Prove that [0, x] = [0, y] forallx, yE Z x40, y 40. 


(b) Prove that [ax, ay] = [x, y] for allaax,yE ZaA0,y #0. 
(c) Prove that [x, x] = [1, 1] for all x € Z, x # 0. 
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23. (a) Prove that [0, 1] is the additive identity of Q. 
(b) Prove that every element of Q has an inverse with respect to addition. 


24. (a) Prove that [1, 1] is the multiplicative identity of Q. 
(b) Prove that every element of Q except [0, 1] has an inverse with 
respect to multiplication. 


25. Prove that the following distributive law holds in Q: [a, b]([x, y] + 
[z, w]) = [a b]lx y] + [e Olfz, w]). 


Discussion and Discovery Exercises 


D1. In Theorem 4.2.5, it is proved that different equivalence classes have 
empty intersection. Explain the logic of this proof using symbols P and 
Q to stand for statements in the proof. 


D2. Let R be the relation on Z defined by akRb if a’? + b? is even. Criticize 
the following ‘‘proof’’ that R is an equivalence relation on Z. 


aRa = a + a = 2a* = even. Therefore, Z is reflexive. 

aRb = a + b* = even = b’ + a’. Therefore, aRb and bRa. Z is symmetric. 
aRb and bRc = a + b? and b* + c? are even. 

a+ c = (a2 + b*) — (b? — c’) = even — even = even. Therefore, 
transitive. 


The following two exercises are for those who have completed an introductory linear 
algebra course. 


D3. What does it mean for two m X n matrices to be row equivalent? Prove 
that row equivalence defines an equivalence relation on the set of all 
m X n matrices. 


D4. What does it mean for two n X n matrices to be similar? Prove that 
similarity defines an equivalence relation on the set of all X n matrices. 


DS. In the remarks following the proof of Theorem 4.2.6, it is asserted that 
if A is a set, then there is a bijection between the set of all equivalance 
relations of A and the set of all partitions of A. Using these remarks 
as a guide, give a formal proof of this assertion by explicitly defining a 
function between these two sets and proving that it is a bijection. 


D6. The definition of a relation R on a set A does not rule out the possibility 
that R is the empty set. For example, if A is the set R of real numbers 
and we say that aRb if a < b and b <a, then R = ©. We say that R 
is the empty relation. Determine whether the empty relation is reflexive, 
symmetric, transitive, or antisymmetric. 
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AE INGAEGERS 


AXIOMS AND BASIC PROPERTIES 


In this chapter we study some of the most important properties of the set 
Z of integers. The study of the integers, called number theory, has held a 
fascination throughout history for both the professional and the amateur 
mathematician. Some of the most intricate and elegant mathematical ideas 
have developed as a result of attempts to solve difficult problems in number 
theory. We will discuss some of those ideas and problems and some of their 
history in this chapter. 


The Axioms of the Integers 


We will not construct the integers or give a formal definition. Our starting 
point will be the familiar properties of addition and multiplication in Z. 


The set Z of integers has two binary operations: addition, denoted by 
+, and multiplication, denoted by -, with the following properties: 


Al. Addition is associative: (x + y) + z =x + (y + 2) for all x, y, 
zEZ. 

A2. Addition is commutative: x + y = y + x for all x, y € Z. 

A3. Z has an identity with respect to addition, namely, the integer 0. 

A4. Every integer x in Z has an inverse with respect to addition, 
namely its negative —x. 
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AS. Multiplication is associative: (x - y) - z =x -(y-: 2) forall x, y, 
zEZ. 

A6. Multiplication is commutative: x - y = y : x for all x, y € Z. 

A7. Z has an identity with respect to multiplication, namely, the 
integer 1 and 1 # 0. 

A8. For all integers x, y, z, x: (y +z) =x-+y +x: Z. These are 
called the distributive laws. 


We do not prove these properties of Z; rather we take them as axioms: 
statements we assume to be true about the integers. We will list three more 
axioms for the integers later in this section. The last axiom, the Well-Ordering 
Principle, leads to the Principle of Induction, an important tool for proving 
theorems in all areas of mathematics. We will discuss induction in Section 5.2. 

Note that it follows from Proposition 4.1.4 that 0 is the only additive 
identity of Z and that 1 is the only multiplicative identity. 

Note also that no mention is made in these properties of inverses with 
respect to multiplication. The reason as we have noted, is that only 1 and 
—1 have multiplicative inverses in the set of integers. For a formal proof of 
this fact, see Corollary 5.1.8 later in the section. 

Notation: From now on, we will use the juxtaposition xy instead of x - y 
to denote multiplication of integers. 

We will also use the familiar notation x — y to mean x + (—y). 

Many of the familiar algebraic properties of the integers can be proven 
from the eight axioms just listed. We state and prove some of these properties 
in the next proposition. 


Let a, b, c € Z. 


Pl. Ifa + b=a-+c, then b=c. 

P2. ad = 0a = 0. 

P3. (—a)b = a(—b) = —(ab). 

P4, —(-a) = a. 
PROOF: P1. Suppose that a + b = a + c. Then, by first adding —a to both 
sides of this equation, we get the following sequence of steps: 


(—a) + (a + b) = (-a) + (a + ¢). 
(-a+a)+b=(-a+a)+c by the associative law Al. 
0+b=0+¢ by Ad. 

b=c by A3. 


P2. Applying A8 and the fact that 0 + 0 = 0, we get: 
a0 = a(0 + 0) = a0 + a. 


Proposition 5.1.2 


Proposition 5.1.3 
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Therefore, a0 + 0 = a0 + a0. 

Now we can apply P1 to conclude that a0 = 0. 

Since multiplication in Z is commutative, we also get 0a = 0. 

P3. We will prove that (—a)b = —(ab). (The proof that a(—b) = —(ab) 
is similar.) 

Notice that the left- and right-hand sides of this equation are conceptually 
different. The left-hand side is the additive inverse of a multiplied on the 
right by b. The right-hand side is the additive inverse of ab. So there is 
something to prove! 

Now ab + (—a)b = (a + (—a))b = 0b = 0,7 by P2. 

We conclude that (—a)b is the additive inverse of ab. Therefore, 
(—a)b = —(ab). 

The proof of P4 is left as an exercise. * * 


The following additional properties are corollaries of Proposition 5.1.1. 
The proofs are left as exercises. 


Let a, b,c € Z. 


P5. (—a)(—b) = ab. 
P6. a(b — c) = ab — ac. 
P7. (-1)a = —a. 

P8. (—1)(-1) = 1. 


As usual, we let Z* be the set of positive integers: Z* = {1, 2, 3, ...} 
where 2 = 1+ 1,3 = 2+ 1,..., and so on. In addition to the eight 
axioms of Z previously listed, we have three additional axioms that are really 
properties of Z*. The first two are: 


A9. Closure Property Z* is closed with respect to addition and 
multiplication: if x, y € Z*, then x + y € Z and xy € Z’*. 
A10. Trichotomy Law For every integer x, exactly one of the fol- 
lowing statements is true: x € Z*, —x © Z*, orx = 0. 


As we did with the first eight axioms, we can prove familiar properties 
of Z using Axioms 9 and 10. 


Ifx € Z,x # 0, then x? € Z. 


PROOF: By Axiom 10, either x € Z* or —x € Zt. If x € Z*, then x? = 
xx € Z* by closure of multiplication. On the other hand, if —x € Z*, then 
x? = xx = (—x)(—x) € Z* by closure of multiplication and P5. «» 
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Inequalities 


In the set Z of integers, there is a notion of one element being ‘‘smaller”’ 
than another. A formal definition of this notion is the following. 


Definition 5.1.4 Let x, y © Z. We say x < y (read: x is less than y) if 
youx& Zt. 


Note: If x < y, we can also write y > x (read: y is greater than x). We also write 
x=yifx<yorx = y. Similarly, y =xify >xory =x. 

Note also that the symbols < and < define relations on Z and that the symbol 
= defines a linear ordering on Z. 


It follows from Definition 5.1.4 that for every integer n, the statement 
n > 0 is equivalent to the statement n € Z* and the statement n < 0 is 
equivalent to the statement —n € Z*. Therefore, we can say: Z* = {n € 
Z|n> O}. 

The following well-known properties of inequalities are consequences of 
this definition as well as Axioms 9 and 10. 


Let a, b, c € Z. 


Ql. Exactly one of the following holds: a < b, b <a, ora = b. 
Q2. If a > 0, then —a < 0 andif a < 0, then —a > 0. 

Q3. Ifa > 0 and b> 0, thena + b>0O0andab > 0. 

Q4. If a > 0 and b < 0, then ab < 0. 

Q5. If a < 0 and b < 0, then ab > 0. 

Q6. Ifa < bandb<.« thena<c. 

Q7. Ifa <b, thena+c<bte. 

Q8. If a < b and c > O, then ac < be. 

Q9. If a < bandc < 0, then ac > be. 


PROOF: Q1. It follows immediately from Axiom 10 that exactly one of the 
following statements is true: 


b-aéeZ, -—(b-a)=a-bEZ, or b-a=0. 

Therefore, exactly one of the following holds: 

a<b, b<a, or a=b. 

Q2. If a > 0, then 0 — (—a) = a € Z’, implying that —a < 0. A similar 
proof shows that if a < 0, then —a > 0. 

Q3. This is just a restatement of Axiom 9. 

Q4. Suppose that a > 0 and b < 0. By Q2, —b > 0 and so by Q3, 
a(—b) > 0. But a(—b) = —(ab) by P3 so —(ab) > 0. 

Applying Q2 again gives —(—(ab)) < 0. But —(—(ab)) = ab, so we can 
conclude that ab < 0. 


Proposition 5.1.6 


Corollary 5.1.7 


Corollary 5.1.8 
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The proofs of Q5 through Q9 are left as exercises. © 


Note: Statements Q2 through Q9 of Proposition 5.1.5 remain true if < 
is replaced by = and > is replaced by =. 


The Well-Ordering Principle 


Our final axiom of Z is known as the Well-Ordering Principle and is another 
property of Z*. 


All. The Well-Ordering Principle Every nonempty subset of Z* 
has a smallest element; that is, if S is a nonempty subset 
of Z*, then there exists a € S such that a = x for allx € S. 


Despite its apparent simplicity, the Well-Ordering Principle is an impor- 
tant tool for proving properties of the integers. In the next section, we will 
see how the Well-Ordering Principle is used to establish the Principle of 
Mathematical Induction. 

We now prove what seems an obvious result but whose formal proof 
requires the Well-Ordering Principle. 


There is no integer x such that0 <x <1. 


PROOF: We will assume that the proposition is false and derive a contradic- 
tion. So suppose there is an integer x such that 0 < x <1. 

Let S = {n € Z* |0<n<1}.Nowx € S by our assumption, so S # ©. 
Thus, by the Well-Ordering Principle, S has a smallest element. Call it xp. 
Since x) < 1, x%) — 1 <0. 

Now by Q4, x(x) — 1) < 0, implying that xj — x) < 0 or x§ < x». Since 
Xo <1, it follows from Q6 that xj < 1. But x§ € Z* (Why?), which means 
that xf € S. 

Thus x is an element of S, which is smaller than x,—a contradiction to 
the choice of x, as the smallest element of S. Therefore our assumption that 
there is an integer x such that 0 < x < 1 must be false and the proposition 
follows, ¢» 


1 is the smallest element of Z*. 


PROOF: We leave it as an exercise. *)* 
We can now prove that only 1 and —1 have multiplicative inverses in Z. 


The only integers having multiplicative inverses in Z are +1. 
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PROOF: First note that the statement of Corollary 5.1.8 is a statement con- 
taining a universal quantifier and an equivalence. It can be restated as: 


For all a € Z, a has a multiplicative inverse in ZS a = +1. 


To prove the first implication, suppose a € Z has a multiplicative inverse 
in Z. Then there exists x € Z such that ax = 1. Clearly a # 0,soa € Z* 
or —a € Z*. Suppose a € Z* and a # 1. Then a > 1 by Corollary 5.1.7, and 
since ax = 1 € Z*, x € Z* also. (See Exercise 11(a).) Now since x # 0, 
x = 1. Multiplying both sides of the inequality a > 1 by x, we get 1 = 
ax > 1x = 1, a contradiction. Therefore a = 1. 

If —a € Z*, a similar proof shows that a = —1. It now follows that if a 
has a multiplicative inverse in Z, then a = +1. 

Conversely, if a = +1, then a has a multiplicative inverse in Z since 
(1)(1) = 1 and (—1)(-1) = 1. 


“» HISTORICAL COMMENTS: PEANO'S AXIOMS 


The axiomatic construction of the integers given in this section came fairly 
late in the history of mathematics, in the late 19th century in fact. Not all 
mathematicians thought that a logical development of the natural numbers 
(that is, positive integers) was possible. ‘God made the integers, all else is 
the work of man,” is the famous quote of Leopold Kronecker (1823-1891). 
Nonetheless, formal definitions of the integers were devised, most notably 
by Giuseppe Peano (1858-1932) in 1889. Peano listed five axioms, or postu- 
lates as they were called, of the natural numbers: 


P1: 1 is a natural number. 

P2: Every natural number has a successor. 

P3: 1 is not the successor of any natural number. 

P4: If the successors of the natural numbers n and m are equal, then 
so are n and m. 

P5: If S is a subset of the natural numbers with the two properties that 
(1) 1 © S and (2) whenever n € S, the successor of n is also in S, 
then S is the set of all natural numbers. 


The notion of successor is left undefined but intuitively we think of the 
successor of 1 as being the number 7 + 1. 

From these five postulates, Peano was able to prove, as theorems, all of 
the axioms of Z given in this section. Postulate 5, for example, is the Principle 
of Induction that we will discuss in the next section and is equivalent to the 
Well-Ordering Principle. Remarkably, with the set of natural numbers and 
Peano’s postulates as a starting point, one can construct the set of integers 
Z, the set of rational numbers Q (see the Historical Comments in Section 
4.2), and the set of real numbers R, without any additional axioms. We have 
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chosen not to adopt Peano’s approach here so that we may more quickly 
get to important algebraic properties of the integers. 


Exercises 5.1 


The following exercises, many of them having to do with very familiar proper- 
ties of the integers, should be done using only the 11 axioms of the integers 
or the results proved in this section. 


1. 


Complete the proofs of P4 through P8 by proving the following: 
(a) —(—a) = a for alla € Z. 

(b) (—a)(—b) = ab for all a, b € Z. 

(c) a(b — c) = ab — ac for all a, b, cE Z. 

(d) (—1)a = —a for alla € Z. 

(e) (~1)(-1) = 1. 


2. Let a, b € Z. Prove that —(a + b) = -a — b. 
3. Let a, b € Z. Suppose that a < b. Prove that —a > —b. 
4. (a) State the negation of Proposition 5.1.3. 


(b) State the converse of Proposition 5.1.3. Prove or disprove. 


. Complete the proof of Proposition 5.1.5 by proving the following: 


(a) If a < 0 and b < 0, then ab > 0. 
(b) Ifa < bandb<c, thena<c. 
(c) Ifa<b,thena+c<b+t+e. 
(d) If a< b andc > 0 then ac < be. 
(e) If a< b andc < 0, then ac > be. 


6. Let a, b, c, d € Z. Prove that ifa < b andc<d,thena+c<b+d. 


10. 


11. 


. Let a, b € Z. Suppose that ab > 0. Prove that either a > 0 and b > 0 


ora<Oandb<0. 


. Let a, b € Z. Prove the following converse of statement Q3 in Propo- 


sition 5.1.5: if a + b > 0 and ab > O, then a > 0 and b > 0. 


. Leta, b,c € Z. 


(a) Suppose that ac < bc and c > 0. Prove that a < b. 
(b) Suppose that ac < bc and c < 0. Prove that a > b. 


Consider the statement S: “If the sum of two integers is less than or 
equal to 100, then one of the numbers is less than or equal to 50.” 

(a) Rewrite S using variables and mathematical symbols. 

(b) State the contrapositive of S. 

(c) Prove or disprove S. 

(d) State the converse of S. Prove or disprove. 


Let a, b € Z. 


(a) Suppose that a € Z* and ab € Z’*. Prove that b € Z*. 
(b) Suppose that a < b and a € Z*. Prove that b € Z*. 
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12. 
13. 
14. 
15. 


16. 
17. 
18. 
19. 


20. 


21. 


22. 
23. 


24. 


25. 


Write the negation of Corollary 5.1.8. 
Prove that x? = x forall x € Z. 
Prove that if x € Z and x < 0, then x* < 0. 


Let P be the statement ‘For all x, y € Z, ifxy = 0, thenx = Oory = 0.” 
(a) Write the negation of P. 

(b) Write the contrapositive of P. 

(c) Prove or disprove P. 

(d) Write the converse of P. Prove or disprove. 


Let x, y, z © Z. Prove that if xy = xz and if x ~ 0, then y = z. 
Let n € Z. Prove that there is no integer x such thatn <x <n +1. 
Prove that there isno smallest element of Z and no largest element of Z. 


Let Z” be the set of negative integers. 

(a) Prove that —1 is the largest element of Z~. 

(b) Prove that every nonempty subset of Z~ has a largest element. 

(c) Prove that Z is closed in Z under addition but not closed under 
multiplication. 


Let my € Z and let T = {n € Z|n = ny}. Prove that every nonempty 
subset of T has a smallest element. 


(a) Write the negation of Axiom 9. 
(b) Write the negation of Axiom 10. 
(c) Write the negation of Axiom 11. 


Prove Corollary 5.1.7. 


Let S be a nonempty subset of Z*. Complete the following sentence: 
An element a is not the smallest element of Sif .... 


(a) Prove that the symbol < defines a relation on Z that is transitive 
but not reflexive and not symmetric. 
(b) Is < an antisymmetric relation? Prove your answer. 


(a) Prove that the symbol = defines a partial ordering on Z. 
(b) Prove that the symbol = defines a linear ordering on Z. 


Discussion and Discovery Exercises 


D1. Criticize the following ‘“‘proof’”’ of Exercise 5(c). 


If a < b, then b —- a > 0. Hence b —-a+c>c. Therefore, b + 
c >a+corin other words,at+c<bte. 


D2. What is wrong with the following “proof” of Corollary 5.1.8? 


Suppose a € Z has a multiplicative inverse in Z. Then there exists 
x € Z such that ax = 1. Soa = 1/x. But if x > 1, or if x < —1, then 
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1/x is not an integer, contradicting the fact that a is an integer. Therefore, 
x = +1. Since ax = 1, it follows that a = +1. 


D3. Let a and b be positive integers. State and prove a theorem that gives 
necessary and sufficient conditions on a and b for the existence of two 
real numbers whose sum is a and whose product is b. For example, 
there are real numbers whose sum is 13 and whose product is 36, but 
there are no real numbers whose sum is 12 and whose product is 40. 
Your answer should be in the form of a biconditional. 


D4. Let N be the set of natural numbers as defined by Peano with the axioms 
P1, P2, P3, P4, PS that were given at the end of this section. These 
axioms allow us to define a function f from N to N by sending each 
n € N to its successor; that is, if we let n, denote the successor of n, 
then f: N > N is defined by f(n) = n,. Note that axiom P2 guarantees 
that every n € N has a successor. In the following, you may use only 
Peano’s axioms. 

(a) Prove that 1 is not in the image of f. 
(b) Prove that f is injective. 

(c) Prove that Im f = N — {1}. 

(d) Prove that N is not a finite set. 

(e) Prove that ifn € N, then f(n) # n. 


INDUCTION 


Induction: A Method of Proof 


Induction is a method for proving statements about the positive integers. 
Let P(n) be such a statement. P(n) may be a formula, such as “‘the sum of 
the first n positive integers is n(n + 1)/2” or “‘the sum of the first m odd 
positive integers is a perfect square.’ Or P(n) may be a statement such as 
“every polynomial of degree n with real coefficients has at most n zeros.” 

The purpose of an induction proof is to show that a statement P(n) is 
true for every positive integer n. The first step is to verify that P(1) is true; 
that is, that the statement is true when n = 1. Once P(1) is established to 
be true, we want to verify P(2), P(3), P(4), and so on. Since there are an 
infinite number of these statements, they cannot all be verified separately. 
In an induction proof, we show that, given a positive integer k for which 
P(k) is true, it follows that the statement P(k + 1) is true. In other words, 
if the statement is true for some integer, then it is true for the following 
integer. This establishes that in the sequence of statements P(1), P(2), 
P(3),..., whenever one statement in this sequence is true, then the next 
statement in the sequence must also be true. 

Since we have already verified that P(1) is true, it now follows that the 


160 


Theorem 5.2.1 


EXAMPLE | 


CHAPTER 5 THE INTEGERS 


next statement P(2) is true. Once P(2) is known to be true then P(3) must 
also be true, then P(4), then P(5), and so on. 

Every P(n), for any given n, has now been verified in a finite number of 
steps. For example, P(500) is true because P(1) is true, which implies P(2) 
is true, which implies P(3) is true,..., which implies P(499) is true, which 
finally implies P(500) is true. 

Thus in an induction proof, there are two steps required: first, prove that 
P(1) is true; and second, assuming P(x) is true for a positive integer k, prove 
that P(k + 1) is true. 

What follows is a formal proof of the induction principle, called the First 
Principle of Mathematical Induction. 


First Principle of Mathematical Induction Let P(n) be a statement about 
the positive integer n. Suppose that 


1. P(A) is true. 
2. Whenever k is a positive integer for which P(k) is true, then 
P(k + 1) is also true. 


Then P(n) is true for every positive integer n. 


PROOF: Let S be the set of all positive integers for which P(n) is false. If 
we can show that S = ©, it will follow that P(n) is true for all positive 
integers n. We will assume that S # © and derive a contradiction. 

Since S # &, by the Well-Ordering Principle, S has a smallest element. 
Call it ko. 

Now 1 is not in S since P(1) is true, so ky > 1 by Corollary 5.1.7. Therefore 
ky — 1 is a positive integer and is not in S by the choice of ky as the smallest 
element of S. Therefore P(k) — 1) is true. But then by condition 2, 
P(k) — 1 + 1) = P(k,) is true. But P(ko) must be false since ky is in S. 

This gives us our contradiction and so S = ©. “2 


The assumption that P(k) is true in condition 2 is called the induction hy- 
pothesis. 


The formula for the sum of the first n positive integers 
_nn+1) 
2 
can be proven by induction. 
Given a positive integer n, let P(n) be the statement 
= n(n +1) 
ae. 


14+2+3+...+n 


14+2+3+...4+0 


Since the formula is obviously true for n = 1, P(1) is true. Therefore 
condition 1 of Theorem 5.2.1 holds. 
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Suppose now that k is a positive integer for which P(k) is true. Then 
k(k +1) 
k=. 
2 
(This is the induction hypothesis.) We now want to establish that P(k + 1) 
is true, which is equivalent to showing that 
(k + 1)(k + 2) 
gs 


Dee re Ane Gs fee es 


1+24+3+...¢k+(k+1)= 


Adding k + 1 to both sides of the induction hypothesis, we get 


k(k + 1) 
2 


14+24+3+...¢k+(k+1)= +(k +1) 


P+k 2k+2 
+ 
2 2 
SREB +2 
2 


_ (k + 1)(k + 2) 
aaa, ea 


Hence P(k + 1) is true. It now follows by induction that P(n) is true for 
every positive integer n. 


We will prove by induction that for every positive integer n, 


Le ee i ee BO SE SL 


Call this statement P(n). 
Since 1 + 2 = 2!*' — 1, we see that P(1) is true. 
Suppose now that k is a positive integer for which P(k) is true. Then 


TD OED ithe ee 


So 
1+2+ 2423+ ...4+ 2+ Qe = Qe — 1 4 get 
=2-21- 4 
= 2h2_— 4, 


which proves that P(k + 1) is true. Therefore, by induction, the formula is 
true for alln € Z*. 


Some Other Forms of Induction 


Sometimes a statement P(n) may be true not for all positive integers n but 
rather for all integers beyond a certain point, say for all integers n > 5 or 
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for all n = 0. A modified form of the Principle of Induction can be used to 
prove such statements. It goes as follows. 


First Principle of Mathematical Induction, Modified Form Let P() be a 
statement about the integer n. Suppose that there is an integer mp such that 


1. P(npo) is true. 
2. Whenever k = ny is an integer for which P(k) is true, then P(k + 1) 
is also true. 


Then P(n) is true for every integer n = no. 


PROOF: We leave it as an exercise. ** 


We will prove by induction that for every integer n = 3, n? > 2n + 1. 

Given a positive integer n, let P(n) be the statement n? > 2n + 1. It is 
clear that P(3) is true. 

Now suppose k = 3 is an integer such that P(k) is true; that is, k* > 
2k + 1. Then we want to prove that P(k + 1) is true; that is, we want to 
prove that (kK + 1) > 2(k +1) +1 = 2k + 3. 

But 


(A+1P =k? +2k+1 
>2k+1+2k+1 
=4k+2 
> 2k + 3. 


The first inequality comes from the induction hypothesis and the second, 
4k + 2 > 2k + 3, is clear since 2k > 1. Therefore, P(k + 1) is true and so 
by Theorem 5.2.2, P(n) is true for all n = 3. 


Sometimes, in order to do a proof by induction, it is necessary to modify 
the induction hypothesis and assume more than just that P(k) is true for a 
given k but rather that P(Z) is true for all positive integers i = k and then 
prove that P(k + 1) is true. This stronger induction hypothesis is still sufficient 
to prove that P(n) is true for all positive integers n. But of course it is 
necessary to prove another theorem to establish that fact. We state it next 
and leave the proof as an exercise as it is very similar to the proof of 
Theorem 5.2.1. 


Second Principle of Mathematical Induction Let P(n) be a statement about 
the positive integer n. Suppose that 


1. P(1) is true. 
2. If kK © Z* and P(i) is true for every positive integer i = k, then 
P(k + 1) is true. 


Then P(n) is true for every positive integer n. 
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Suppose that fis function from Z* to Z* such that f(1) = 1, f(2) = 5, and 
f(n + 1) =f(n) + 2f(n — 1) for all n = 2. This is an example of a recursively 
defined function. All values of f are uniquely determined by these equations 
but fis not defined by an explicit formula. Rather the value of f at a positive 
integer k is determined by its values at positive integers preceding k. 

Many functions that are defined recursively do have explicit formulas 
and induction is often the way to verify such a formula. In this example we 
will use the Second Principle of Induction to verify an explicit formula 
for f(n). 

Induction, however, does not tell us what this formula is. One can make 
an educated guess for f by plugging values of n into f and then trying to 
detect a pattern. 

Since f(3) = 7, f(4) = 17, f(5) = 31, and f(6) = 65, we see that f(7) is 
one more or one less than a power of 2 and a little trial and error will lead 
us to the formula 


fin) = 2 + (-1y. 


Let P(n) be the statement f(m) = 2” + (—1)". Since f(1) = 1 = 2! + 
(—1)! and f(2) = 5 = 2? + (-1)*, P(1) and P(2) are true. 

Next suppose that k is a positive integer greater than 2 such that P(i) is 
true for all i = 1, 2,...k. Then, in particular, P(k) and P(k — 1) are true. 
So we have 


f(k) = 2 + (-1)* and f(k — 1) = 242 + (-1)e, 


Hence 


f(k + 1) = fk) + 2f(k — 1) 
= 26 (=I) 20 eae) 
= oh (als 2+ (= 
= 2-2 + (-1)(1 — 2) 
= k+l + (-1)**!, 


This shows that P(k + 1) is true and therefore, by the Second Principle 
of Induction, P() is true for all positive integers n. 


One can state a modified form of the Second Principle of Induction 
similar to the modified form of the First Principle. We leave the statement 
as an exercise. 

The Principle of Induction is often stated in the language of sets. The 
theorem that follows is a restatement of the First Principle of Mathematical 
Induction in set theory language. The other forms of induction may be 
restated in a similar way. 
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Let S be a subset of Z*. Suppose that 


1L1eéES. 
2. If k is a positive integer for which k € S, thenk +1€E 5S. 


Then S = Z*. 


PROOF: We leave it as an exercise. @ 
As an example of Theorem 5.2.4, let’s revisit Example 1. 
Let 


s={nez 142434... 4 = MED 


2 


Then clearly 1 © S. Suppose now that k € S. Then 


14243+...4¢= SEED 


It follows that 


142434... 4k + (k+1)=MEED 


+ (k +1) 

eP+k 2k+2 
Es + 

2 2 
ah 3h 2 
2 
_ (k + I(k + 2) 
2 


proving that k + 1 © S. By TheoremS.2.4, S = Z*. We can now conclude that 


_ n(n +1) 


1424+34...4+n 5) 


for all positive integers n. Note that the algebraic content of this proof is 
the same as Example 1, so that there is no substantive difference in the 


two approaches. 


Often induction can be used to generalize the results of a theorem. Recall 
that in Theorem 2.2.3, we proved that if A, B, and C are sets, then A M 
(B UC) = (ANB) U (AN C). The following theorem is a generalization 


of that fact. 
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Theorem 5.2.5 Let A, B,, B,,..., B, be sets. Then A M(B, UB, U...UB,) =(AN 
By) (ABs) Oe UAB 


PROOF: The induction proof will be done with respect to the variable n. 
Let P(n) be the statement: ‘‘Whenever A, B,, B,,..., B, are sets, then AM 
(B, U B, a eee U B,) i (A M B,) U(A M By) ened (A M B,).” 

Clearly P(1) is true since it is merely an identity. 

Suppose now that k is a positive integer such that P(k) is true. Then A 
(B, UB,U...U By) =(ANB,) U(ANB)U...U(AN B,) whenever 
we have sets A, B,, B),..., By. This is the induction hypothesis. 

Now we wish to prove that P(k + 1) is true. So we let A, B,, B),..., 
B,, By be sets. Then 


AN (B,UB,U...U By U Bes) 
=AN((B,UBU...U By) U Best) 
=(AN(B,UB,U...U By)) U(AN Bury) 
=((ANB,) U(ANB)U...U(AN B,)) U(AN Best) 
=(ANB,)U(ANB)U...U(AN By) U (AN Baas). 


The second equality follows from Theorem 2.2.3, which is the case k = 
2, and the third one from the induction hypothesis. Since P(k + 1) is now 
proved, the proof of the theorem is complete. “* 


The Binomial Theorem 


An important application of mathematical induction is the Binomial 
Theorem. 


Definition 5.2.6 Letn € Z* andr € Zsuch that 0 =r Sn. The binomial 


! 
coefficient (") is defined as (") ds 


As usual, n! = (n)(n — 1)(n — 2)...2-lifn = 1 and 0! = 1. 
The binomial coefficient iC is the number of ways of choosing r objects 


from a collection of n objects. An equivalent interpretation is that it is the 
number of subsets of r elements contained in a set with n elements. 


Theorem 5.2.7. Leta, b € Z and letn € Z*. Then 


(a+by=> io arb. 


k=0 
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PROOF: We’ll assume a and b are given and that, for each n € Zt, P(n) is 


the statement: (a + b)" = > (7) a"-*bk. 


k=0 
It is easy to see that P(1) is true. Now suppose that n is a given positive 
integer and that P(n) is true. We will prove that P(n + 1) is true. So we are 


assuming that (a + b)" = > ”) a"-*b* (the induction hypothesis) and we 


k=0 
nt+1 
+1 
want to prove that (a + by"! = > : k 
k=0 


) qi tlkpe. 
(a + by"! = (a+ bya + by 


=a(a+b)"+ b(a+t by" 


39 ("\ 6% a id ee 
aS ("Ja 46S ("a b 


k=0 


= S a grt imkpk + s (") qr-kpkt! 
k=0 


k=0 


n n-1 
= pntl AY nti-kyk WY n-kyktl ntl 
ar S (ferro S (f)ertat som 


Now by a change of index, we can write 


we kena  ( n ) n+1-kpk 
5 (i) ¢ : =2 eae & 


k=0 


Hence we have 


(a + by"*! = at! + (") + : n )| grti-kpk 4 peti 
k=l = 


= qi} + y 4 : Z grti-kpk + prt! 


kal 
ntl n+ ') 

= qrti-kpk, 
ie 


Note that we have used the identity 


n n \_{nt+1 
(+ (,",)=( k ) forls=k<sn. 


This is a straightforward algebraic calculation. The details are left as 
an exercise. 


Corollary 5.2.8 
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This proves P(n + 1) and by induction it follows that P(7) is true for all 
positive integers n. 


5 ({)=2 


PROOF: Apply the Binomial Theorem with a = b = 1. @ 


Note: The Binomial Theorem is also true when a and b are real numbers. 
The reason is that the axioms of addition and multiplication that hold in 
Z (Axioms 1-8) also hold in R. These matters are discussed in detail in 
Chapter 7. 


MATHEMATICAL PERSPECTIVE: BERNOULLI NUMBERS 


We conclude this section with a discussion of the Bernoulli numbers, which 
were alluded to in the Introduction. The Bernoulli numbers are a sequence 
of rational numbers defined by Jakob Bernoulli (1654-1705). They have 
had some very interesting applications in mathematics over the last several 
hundred years. As background, consider the following formulas: 


14+2434...4(9-1)= MOD 
PEP evs. ys Very 
aie 7 
2 -1y 
Se ef pc 
342343 (n -1) j 


These formulas for sums of powers can be proven by induction. The first 
is an easy consequence of the formula in Example 1 and the other two 
are exercises. 

Can you see any pattern to these formulas? Nothing is immediately 
obvious, but note that the right-hand side of each equation is a polynomial 
expression in the variable n. The first is $n? — $n, the second gn? — $n’ + gn, 
and the third 4n* — n> + 4n’. Note that each of these polynomials has 0 as 
its constant term and that the highest power of n that appears is one more 
than the exponent on the left-hand side of the equation. In looking at other 
formulas of the form 1* + 2* + 3 + ...+ (m — 1)*, Bernoulli noted that 
the sum was a polynomial in n of degree k + 1 and 0 constant term. The 
coefficient of n in these polynomials takes on the values —3, §, 0, —35, 0 for 
k = 1, 2, 3, 4, 5. These are the first five Bernoulli numbers. Bernoulli was 
led to the following recursive definition. 
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Definition 5.2.9 The sequence of numbers Bo, B,, B2, .. a called Ber- 
noulli-numbers, are defined by B, = 1 and if Bo, B,, Bo, ... , B,-; are 


defined then 


Cee eeaspeere d 
Bone ||) 


Using this formula for t = 1, 2, 3, 4, we obtain the following equations: 


n= -4[e()a]-% 
n--i[()ee()aGal- 
roo) (a]=— 


Other Bernoulli numbers can be computed in similar fashion. It should 
be noted that if t is an odd positive integer >1, then B, = 0. This fact is not 
easy to prove from our definition. 

Using these numbers Bernoulli was able to give a formula for the sum 
of the first n kth powers. We state the result without proof. You are encour- 
aged to learn the proof on your own. One source is A Classical Introduction 
to Modern Number Theory by K. Ireland and M. Rosen [10]. 


Bernoulli If k is a positive integer, then 


1 &/k+1 
kK4 Ok 4 3k 4 00 4 (yn — 1) = —— nkt)-i, 
1*§+2'+3 (n — 1) 73 ( j ) Br 


The reader should verify the formula for k = 1, 2, 3 to see that it agrees 
with the formulas given. 

Another application of the Bernoulli numbers is to the Riemann zeta 
function, which was discussed in the Introduction. Recall that for a real 


number s > 1, s(s) = >. 1/n’. Euler proved that s(2) = 72/6 and in the 
n=] 


Introduction we gave his informal proof of that fact. Euler generalized that 
result to even integers and again Bernoulli numbers are involved. More 
specifically, he proved that if k is a positive integer, then 
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s(2k) = (-1)*" 


As an example, we see that the series >) 1/n* converges to 74/90. 


n=1 


Exercises 5.2 


1. Prove the following formulas using mathematical induction. 
(a) 1+345+...4+ Qn -1) =n? 
(b) 1724+ 224+ 37 4+ 4 yp = Nt On + 1) 
roe 6 : 
w(n+1) 


CO)? 2? AP 1 ea 4 


2. Prove the following: 
(a) 2+3274+54...4+(2n-1P = one Lice alee) 
(2n)(2n + 1)(2n + 2) 
S : 
3. Prove that if a is any real number except 1, then 


(a"*! — 1) 
a-1— 


(b) 27+ 4 +6 4+...+ (Qn = 


lt+tat+@ta@t+...¢a’= 


4. (a) Prove that 2" > n’ for all integers n = 5. 
(b) Prove that 2” < n! for all n = 4. 


5. Let a, bj, by,...,b, € Z. Prove that a(bj + by +... + b,) = ab, + ab, 
+... + aby. 


6. Let f: Z* — Z* be defined recursively by f(1) = 1 and f(n + 1) = 
f(n) + 2" for all n © Z*. Prove that f(n) = 2” — 1. 


7. Let f: Z* — R be defined recursively by f(1) = 1 and f(n + 1) = 
V2 + f(n) for all n € Z*. Prove that f(n) < 2 for alln € Z*. 


8. The Fibonnaci numbers /,, 7 = 1, 2,3, ..., are defined recursively by 
the formulas f, = 1,4 = 1,f, = fia t+ fi-2 for n = 3. 
(a) Write out the first ten Fibonnaci numbers. 
(b) Computefi +f, tAt+hAtAt+At Sh. 
(c) Derive a formula for the sum of the first n Fibonnaci numbers and 
prove it by induction. 
(d) Prove that f7 + fi+...+ 2 = fifrs for all n = 1. 


9. Let H,, be the number of handshakes required if in a group of n people 
each person shakes with every other person exactly once. 
(a) Compute H, for n = 2,..., 6. 
(b) Find a recursion formula for H,.; in terms of H,,. 


170 


CHAPTER 5 THE INTEGERS 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
19. 
20. 


21. 
22. 


(c) Find an explicit formula for H,. 


Let A,, A2,..., A, be a collection of finite mutually disjoint sets. Prove 
that 


ae 


= » lA]. 


(This is Corollary 2.3.5 of Section 2.3. You may assume Theorem 2.3.4 
given in that section.) 


Let * be an associative binary operation on a set A with identity element 
e. Let B be a subset of A that is closed under *. Let b,, b),...b, € B. 
Prove that b; * b) *...* b, € B. 


Let * be an associative binary operation on a set A withidentity element 
e. In Exercise 32 of Section 4.1, you were asked to prove that if B and 
C are subsets of A that are closed under *, then B M C is closed under 
*, Prove by induction that if B,, B,,..., B, are subsets of A that are 
all closed under *, then B} N B, MN... B, is closed under *. 


(a) Let n be an integer. Prove by induction that if n is even, then n* is 
even for all k € Z*. 
(Note: the induction should be done with the variable k, not the variable n.) 


(b) State the converse of part (a). Prove or disprove. 


Prove by induction that if m,, m,...,n, are even integers, then n, + 
ny +... +n, is even. 

Let A, B,, B,,...,B, be sets. Generalize part 2 of Theorem 2.2.3 by 
proving that 


AU (BLN BN... B,) =(AUB)N(AUB)N...N(AUB,). 


Let A be a nonempty set. Let fi, 6, ..., fr © F(A). Prove: 

(a) If ff, fi,..., f, are surjective, then the composition f,f, ... f, is sur- 
jective. 

(b) Iffi,ft,...,f, are injective, then the composition f,f,. . . f, is injective. 

(c) If fi, fi, ..., f, are invertible, then the composition f\f, ... fh, is 


invertible and (fifp ...f:)7! = falfatr ... fyify'. 


Let P(n) be a statement about the positive integer n. Using the negations 
of conditions 1 and 2 in Theorem 5.2.1, complete the following sentence: 
P(n) is false for some positive integer nif... . 


Prove Theorem 5.2.2. 
Prove Theorem 5.2.3. 


State a modified form of the Second Principle of Induction similar to 
the modified form of the First Principle of Induction. 


Prove Theorem 5.2.4. 


Restate Theorem 5.2.3 in set theory language. 


5.2 


23. 


24. 


2): 


26. 


27. 
28. 
29. 


30. 


31. 
32. 
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Let A be a finite set with n elements. Use Corollary 5.2.8 to prove that 
P(A), the power set of A, has 2” elements. 


Prove that if m and k are integers such that 1 = k < n, then 
a ee cca 6 +1 
Mey Te 


Prove that if m and k are integers such that 1 = k <n, then (7) is 


an integer. 


Prove that >’ (-—1) (7) = 0. 
k=0 k 


Use the definition of Bernoulli numbers to compute B;, Bs, By, Bs. 
Verify Bernoulli’s Theorem for k = 1, 2, 3. 


Use Bernoulli’s Theorem to obtain formulas for 1* + 2 + 3+ ...4+ 
(n — 1)‘ for k = 4, 5, 6, 7. 


In this problem n and k are fixed positive integers. Let S,(m) = 1* + 
2 + 3k + ... + (n — 1). The purpose of this problem is to give a 
recursion formula for S,(n). This will give an alternative to Bernoulli’s 
Theorem for computing S;,(7). 

(a) Prove that if / is a positive integer, then 


cera (Ere (RE) p 


k+1 k+1 
=f eee me 
CS eee 
n-1 
(b) Prove that n**! = 5) [(1 + 1)! — UY], 
!=0 
(c) Use (a) and (b) to prove that 


ae 6 *) soy + (‘ £1) sum 


ree (i o ') Samy i: ; S,(n). 


(d) Use this recursion formula and not Bernoulli’s Theorem to com- 
pute S,(n). 


Use Euler’s Theorem to calculate s(s) for s = 6 ands = 8. 


This problem shows the connection between addition and multiplication 
in Z. Let t © Z. Use induction to prove thatmt=t+t+t+...+t 
(n times) for all n € Z*. 
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33. 


34. 


35. 


36. 


37 


38. 


39. 


Suppose that P is a subset of Z with the following properties: (1) P is 
closed with respect to addition and multiplication; and (2) for every 
element x of Z, exactly one of the following properties holds: x € P, 
x = 0, or —x € P. Prove that P = Z*. 

Hint: First prove that 1 © P.and then using induction, prove that Z* C P. Finally, 
prove that P C Z*. 

Let A be a set with n elements and B a set with m elements. Prove that 
A X B has nm elements. 

Hint: With m fixed, prove the result by induction with respect to n. 

Let A be a set with n elements. Let S(A) be the set of all permutations 
of A; that is, S(A) is the set of all bijective functions from A to A. Prove 
by induction that S(A) contains n! elements. 


Let f: Z — Z be a function with the property that f(x + y) = f(x) + 
f(y) for all x and y € Z. 

(a) Prove that f(0) = 0. 

(b) Prove that f(—x) = —f(x) for all x € Z. 

(c) Prove that f(x — y) = f(x) — f(y) for all x, y € Z. 

(d) Prove that if x,, %,..., x, © Z, then f(x, + x» +... +x) = 


f(x) + fa) +... + f(%n). 
(e) Prove that da € Z 3 f(n) = na for alln € Z. 


A polygon with n sides is called an n-gon. An n-gon is said to be convex 
if the straight line connecting any pair of interior points is contained in 
the interior. Assuming that the sum of the radian measures of the interior 
angles of a triangle is 7, prove by induction that the sum of the radian 
measures of the interior angles of a convex n-gon is 7(n — 2). 


Assuming the Sum Rule, 
© (FG) + 8(2)) = £0) + 8°). 
prove by induction that 
© (i) +A) + + FCO) = Fi) +f) +. + Fala) 


forn € Zt. 
Assuming that the indefinite integral is additive: 


[ (£0) + 80) de = | fe) de + [ g(x) at, 
prove that 
[ iG) +f) +... + AQ) dx = | fi) dx 
+ | f(x)de+...+ | fo dx 


forn € Z. 
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40. Assuming the Product Rule, 
© (f(x)8(x)) = flag’) +f C(2), 
prove by induction that 
£ (FOI) = (FOF) 


forn € Z*. 
41. Prove that for every real number x = 0 and every positive integer n, 


2 3 n 
x xX x 
— +4 ..4 


eee ae aly aye af 


42. Assuming that (1 + 1) < e, for alln € Z*, prove by induction that n! > 


Hint: This problem can also be done without induction by using the previous 
problem. 


Discussion and Discovery Exercises 


D1. Induction proofs are often unsatisfactory in the sense that they do not 
tell us why something is true. Give a proof of the formula 


_ n(n + 1) 


14+2+3+...+n ) 


(see Example 1 of this section) that is not an induction proof. Explain 
why this may be a better proof than the induction one. 

D2. The numbers1,1+2,1+2+3,1+2+3+4,...are called triangular 
numbers because they can be arranged in triangular form: 


e 
e e e 
e e t e e e 
e e e e e e e e e e 
1 1+2 1+2+3 14+2+3+4 


If we let 7,, denote the nth triangular number, then 7; = 1, 7; = 3, 
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D3. 


D4. 


DS. 


T; = 6, T, = 10, and so on. In Example 1 of this section, we proved by 
induction a formula for the nth triangular number; namely, 


_ n(n +1) 


T, 5 


(a) Prove that the sum of two consecutive triangular numbers is a 
perfect square. 

(b) Let T, be the nth triangular number. Prove that 87, + 1 is a per- 
fect square. 

(c) Prove the following converse of part (b): If tis an integer and 8¢ + 
1 is a perfect square, then ¢ is a triangular number. 

(d) Prove that the difference of the squares of two consecutive triangu- 
lar numbers is a cube. 


(e) Prove that the infinite series > rT converges to 2. 


n=l 4n 

(f) Find 2 triangular numbers that are also squares. 

(g) Prove that if there exist integers x and y such that x? — 8y? = 1, 
then y’ is a number that is both a square and a triangular number. 

(h) Leta =3 + V8. It can be proved, although it is not easy to do 
so, that if one takes powers of a and writes them in the form 
x+ yv38, then y’ is both a square and a triangular number. Use 
this result to find four numbers that are both square and triangular. 
Show that the numbers you find are indeed triangular. 


Similar to the triangular numbers are the numbers 1, 1 + 3,1 + 3 + 

5,1 +3 +5 + 7,... and so on. These numbers are called square 

numbers because they are all perfect squares. In fact, if S,, denotes the 

nth square number, then S, = n?. (See Exercise 1(a).) 

(a) Draw a diagram that justifies the word square for this sequence 
of numbers. 

(b) Draw a picture that illustrates the fact that if n > 1, then the nth 
square number is the sum of two consecutive triangular numbers. 


Now consider the sequence of numbers 1,1 + 4,1+4+7,1+ 4+ 

7+ 10,..., and so on. These numbers are called pentagonal numbers. 

(a) Draw a diagram that justifies the word pentagonal for this sequence 
of numbers. 

(b) If P,, denotes the mth pentagonal number, find a formula for P,, and 
use induction to prove it. 

(c) Define a sequence of numbers that goes the next step beyond pen- 
tagonal numbers and that might be called hexagonal numbers. Can 
you draw a picture to justify the name hexagonal? Find and prove 
a formula for these numbers. 


Consider the sequence of numbers: 1,3 + 5,7 +94 11, 13 + 15 + 
17 + 19,....Is there a pattern to this sequence that leads to a formula? 
If so, find the formula and prove it. 
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THE DIVISION ALGORITHM AND GREATEST 
CommMON Dyivisors 


As we all learn in grade school arithmetic, dividing an integer by a positive 
integer gives an integer plus a remainder that is positive or zero and is less 
than the original divisor. For example, ¢ = 3 + 2, so the remainder is 2. The 
following theorem, known as the Division Algorithm, is a formal statement 
of this fact. 


Division Algorithm Let a, b be integers with b > 0. Then there exist unique 
integers gq and r such that a = bg + rwhereO =r< b. 


PROOF: Let S = {n € Z|n =a — bx, for some x € Z} and let S) = {n € 
S | n = O}; that is, Sp is the set of nonnegative elements of S. 

We claim that S) is nonempty. Note that a € S since a = a — 5: 0. So 
ifa = 0,a € Sy. If a < 0, then a — ba € S. But a — ba = a(1 — b) = 0 
since a <0 and 1 — b= 0. Hence a — ba € Sj. In any case, Sp is nonempty. 

Now let r be the smallest element of Sy. If 0 € Sy, then r = 0. If 0 € Sp, 
then r exists by the Well-Ordering Principle. 

We have r = a — bq for some g € Z and r = 0. Suppose that r = b. 
Then r — b =a — b(q + 1) € Sp. But this contradicts the fact that r is the 
least element of Sy. Hence 0 =r < b. 

To prove uniqueness, suppose there also exist g; and 7 in Z such that 
a = bq, + r; where 0 = r, < b. Without loss of generality, we can assume 
thatr=7,. 

Since a = bq + r, we have 


bq -q=r-n=0. 


Therefore, gq, — g = 0. If gq; — gq # 0, then r — r, = b, a contradiction 
since 0 =r < band0<v7, < b. It follows immediately that q, = g and 
n= Pr. ae 


Divisors and Greatest Common Divisors 


When the remainder in the Division Algorithm is zero, we say that a is 
divisible by b or b divides a. More formally, we have: 


Definition 5.3.2 Let a, b be integers. We say b divides a, written b | a, 
if there is an integer c such that bc = a. We say that b and c are factors 
of a or that a is divisible by b and c. : 
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—4 divides 12, since (—4)(—3) = 12. 
3 does not divide 7, since 7 = (3)(2) + 1. 


We prove by induction that 3 divides 4" —1 for every positive integer n. 

Let P(n) be the statement ‘3 divides 4” — 1.” Because 4 — 1 = 3 and 3 
| 3, we have that P(1) is true. 

Suppose that k is a positive integer for which P(k) is true. Then 3 divides 
4 — 1. Therefore 4 — 1 = 3 for some t € Z. Hence 4**! — 
1=4-4 —-1= 4(4* —- 1) + 3 = 430) + 3 = 3(4t + 1). This proves that 
3 divides 4**! — 1 and so P(k + 1) is true. 

Therefore P(n) is true for all positive integers n. 


Leta, b,c € Z. 


1. Ifa|1thena = +1. 

2. Ifa|b and b|a, thena = +b. 

3. Ifa |b anda|c, thena| bx + cy for any x, y € Z. 
4. Ifa|band b|c, thena|c. 


PROOF: 1. If a | 1, then there exists x € Z such that ax = 1. Hence a has a 
multiplicative inverse in Z. Thus by Corollary 5.1.8, a = +1. 
The proofs of 2, 3, and 4 are left as exercises. “ 


Given a pair of integers a and b, we can consider all integers that divide 
both a and 5, called common divisors of a and b. For example, if a = 12 
and b = 18, then +1, +2, +3, +6 are all the common divisors of a and b. 
The largest of these is 6 and is divisible by all of the others. 


Definition 5.3.4 Let a, b be integers, not both zero. A positive integer 
d is called a greatest common divisor of a and b if 


1. d divides a and d divides b; that is, d is a common divisor of a 
and b 
2. whenever an integer c divides both a and b, then c divides d. 


The next theorem shows that every pair of integers a and b (not both 
zero) has a greatest common divisor and that it is unique. We will then show 
how the Division Algorithm can be used to compute the greatest common 
divisor of a pair of integers. 


Let a, b € Z, not both zero. Then a greatest common divisor d of a and b 
exists and is unique. Moreover, there exist integers x and y such that d = 
ax + by. 


Lemma 5.3.6 
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PROOF: Let S = {n € Z|n = ax + by, for some x, y € Z}. S is a subset of 
Z that contains both a and b sincea =a-:1+b-Oandb=a-0+ 5-1. 

Similarly —a and —b are in S, so S must contain positive integers. By 
the Well-Ordering Principle, S has a smallest positive element. Call it d. We 
now show that d is a greatest common divisor of a and b. 

First, notice that d € S, so there exist integers x and y such that d = 
ax + by. This proves the second statement of the theorem. Next, applying 
the Division Algorithm to a and d, there exist g, r € Z such that a = dq + 
rwhereO=r<d. 

But 


r=a—dq=a-— (ax + by)q = a(1 — xq) + b(-yq), 


which implies that r € S. 

Since d is the least positive integer in S, we must have r = 0. Therefore 
d | a. Similarly, we can show that d | b. 

Finally, suppose that c is a common divisor of a and b. Then there exist 
integers u and v such that cu = a and cv = b. Therefore, d = ax + by = 
c(ux + vy), and we have that c | d. This proves that d is a greatest common 
divisor of a and b. 

We leave the proof of uniqueness as an exercise. «7? 


Since a greatest common divisor (abbreviated g.c.d.) of two integers a 
and b, not both zero, exists and is unique by the theorem just proved, we 
will refer to it as the greatest common divisor of a and b, and we will denote 
it by (a, b). In context, there should be no confusion between this notation 
and the notation for ordered pairs of real numbers. 


Euclidean Algorithm 


We now give a procedure, using the Division Algorithm, for computing the 
g.c.d. of two integers. The following lemma will be helpful. We leave the 
proof as an exercise. 


Let a, b € Z, not both zero. Suppose that there exist integers gq and r such 
that a = bq + r. Then (a, b) = (0, r). 


Let a, b be two positive integers with a > b. By repeated application of 
the Division Algorithm, we get 


a=bq,+n, where q,,7,€ ZO0517,<b, 
b=ngatn, where 2,1 € Z,0O=n<n, 
nN =Nng3 t's, where @3,5€Z,05743<n, 
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Because each remainder is strictly less than the previous one, we eventu- 
ally get 


Fn-2 = Vn-1Qn + Vay where qn, ln © Z, OS 0 n < Mn, 
Vn-1 = VnGn+1 + Yast, where qg,,,;€Z and 7,4; =0. 


By Lemma 5.3.6, we have 
(a, b) = (6, n) = (1, 1) = (rn, r3) ioe ta tie (Tn-15 Tn) = ns 0) =T?n- 


Thus the g.c.d. of a and b is the last nonzero remainder that occurs in this 
procedure. (If r; = 0, then (a, b) = b). This procedure for finding the g.c.d. 
of two integers is called the Euclidean Algorithm. 


Let a = 9180 and b = 1122. 


9180 = (1122)(8) + 204 
1122 = (204)(5) + 102 
204 = (102)(2) + 0. 


Therefore, (9180, 1122) = 102. 


The previous procedure can also be used to find integers x and y such that 
(a, b) = ax + by. 
Using Example 4, we get 


102 = 1122 + 204(-5) 
= 1122 + (9180 + 1122(—8))(—5) 
= 1122 + 9180(—5) + 1122(40) 
= 1122(41) + 9180(-5). 


Hence x = —5 and y = 41. 


Relatively Prime Integers 


Often a pair of integers will have no common divisors except for the obvious 
ones of 1 and —1. 


Definition 5.3.7 Two integers a and b, not both zero, are said to be 
relatively prime if (a, b) = 1. 


For example, 12 and 35 are relatively prime, as are 14 and 15. 
The following result is a useful characterization of relatively prime 
integers. 


Theorem 5.3.8 


Theorem 5.3.9 


Theorem 5.3.10 
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Let a, b € Z. Then a and Db are relatively prime if and only if there exist 
integers x and y such that ax + by = 1. 


PROOF: Suppose a and D are relatively prime. Then by Theorem 5.3.5, 1 = 
(a, b) = ax + by for some x, y € Z. 

Conversely, suppose there exist x, y © Z such that ax + by = 1. Let 
d = (a, b). Since d divides both a and b, then by Proposition 5.3.3, part 3, 
d also divides ax + by. 

Therefore d | 1, and by Proposition 5.3.3, part 1, d = 1. @® 


It is possible for an integer a to divide the product of two integers without 
a dividing either of the factors. For example, 6 | 4 - 3 but 6 does not divide 
either 4 or 3. However, if a is relatively prime to one of the factors, then it 
must divide the other factor. A proof of this fact follows and is our first 
important application of Theorem 5.3.8. 


Let a, b, c € Z. Suppose a | bc and (a, b) = 1. Then a | c. 


PROOF: Since a | bc, there is a z in Z such that az = bc. Also, by Theorem 
5.3.8, there are x, y in Z such that ax + by = 1. Then c = c(ax + by) = 
cax + cby = cax + azy = a(cx + zy). Hence a|c. @ 


@ HISTORICAL COMMENTS: PYTHAGOREAN TRIPLES 


The Pythagorean Theorem gives rise to the equation x’ + y? = z’. Integer 
solutions of this equation are called Pythagorean triples. It is not hard to 
show that if (x, y, z) is a Pythagorean triple, then any multiple (¢x, ty, tz) 
where fis an integer is also a Pythagorean triple. If (x, y, z) is a Pythagorean 
triple and the integers x, y, and z have no common factor, we say that (x, 
y, Z) is a primitive Pythagorean triple. Some examples are: (3, 4, 5), (5, 12, 
13), (8, 15, 17), and (20, 21, 29). The following theorem, due to the Greek 
mathematician Diophantus, characterizes the primitive Pythagorean triples. 


Let x, y, and z be positive integers. Then (x, y, z) is a primitive Pythagorean 
triple if and only if there exist relatively prime positive integers a and b, 
a > b, and a and b having opposite parity, (meaning one of them is even 
and one odd) such that x = 2ab, y= @ -— b’,z=a@ + b?. & 


By taking a = 3 and b = 2, for example, we get the triple (12, 5, 13). 

Diophantus, who lived in the third century 4.p., is famous for having 
written Arithmetica, the earliest known treatise on algebra. It consists of 
thirteen books, only six of which have survived. Diophantus was interested 
in methods for finding integer and rational solutions of equations.and Arith- 
metica is filled with examples and techniques for solving such equations, 
which have become known as Diophantine equations. 
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Exercises 5.3 


. (a) Prove that every integer 7 is of the form 34, 3t + 1 or 3t + 2 for 


some ¢ € Z. 
(b) Prove that every integer n is of the form 41, 4¢ + 1, 4t + 2 or 4t + 
3 for some ¢ € Z. 


. Prove that n? — n is divisible by 2 for every integer n. 


3. Prove that ifn € Z, then n? + 2 is not divisible by 4. 


11. 


12. 
13. 


14. 


15. 
16. 


. Prove by induction that for every positive integer n: 


(a) 4 divides 5" — 1. 
(b) 7 divides 2" — 1. 


. Generalize Problem 4 by proving that if a € Z, then for every positive 


integer n, a — 1 divides a" — 1. 


. (a) Prove part 2 of Proposition 5.3.3. 


(b) Prove part 3 of Proposition 5.3.3. 

(c) Prove part 4 of Proposition 5.3.3. 

(d) State the contrapositive of each part of Proposition 5.3.3. 

(e) State the converse of each part of Proposition 5.3.3. Prove or disprove 
each converse. 


. Leta, b € Z, not both zero. Complete the proof of Theorem 5.3.5 by 


showing that a g.c.d. of a and b is unique. 


. Let a, b € Z, not both zero. Let S = {n € Z|n = ax + by, for some 


x, y € Z}, the set defined in the proof of Theorem 5.3.5. Let d = (a, b). 
Prove that S is the set of all integer multiples of d. 


. Prove Lemma 5.3.6. 
10. 


Compute the g.c.d.s of the following pairs of integers. Then find integers 
x and y such that (a, b) = ax + by. 

(a) a = 1840, b = 1518 

(b) a = 1001, b = 3465 

(c) a = 255, b = —143 


Let a, b € Z, not both zero, and let d = (a, b). Prove that (4, 2) = 


Let a, b, c € Z. Suppose that (a, c) = (b, c) = 1. Prove that (ab, c) = 1. 


Let a, b, c € Z. Suppose that a | c and b | c and that (a, b) = 1. Prove 
that ab | c. 


Let n, m € Z such that (n, m) = 1. Prove that nZ 1 mZ = nmZ. 
Recall that nZ is the set of all integer multiples of n. 


Prove that for every integer n, (n,n + 1) = 1. 


(a) Prove that if n is an odd integer, then (n, n + 2) = 1. 
(b) If 1 is an even integer, what is (n, n + 2)? Prove your answer. 


5.3 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
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(a) State the contrapositive of Theorem 5.3.9. 
(b) State the converse of Theorem 5.3.9. Prove or disprove. 


Prove that if (x, y, z) is a Pythagorean triple, then for any integer 4, (tx, 
ty, tz) is a Pythagorean triple. 


Prove that if (x, y, z) is a Pythagorean triple, then there exists integers 
d, u, v, w such that x = du, y = dv, and z = dw where (u, v, w) is a 
primitive Pythagorean triple. 


Prove that if (x, y, z) is a primitive Pythagorean triple, then x and y have 
opposite parity. 

Prove that if (x, y, z) is a primitive Pythagorean triple, then x or y is 
divisible by 3. 

Prove that if (x, y, z) is a primitive Pythagorean triple, then x, y or z is 
divisible by 5. 

Let C be the unit circle given by the equation x* + y’ = 1. A rational 
point on C is a point (a, b) where a and 5b are rational numbers. For 


example, (1, 0), (0, 1), and (3/5, 4/5) are rational points on C. Prove that 
there are infinitely many rational points on C. 


Discussion and Discovery Exercises 


D1. 


D2. 


D3. 


D4. 


DS. 


Explain why the restriction that a and b be not both zeroin the statement 
of Theorem 5.3.5 is a necessary condition for that theorem to be true. 
Also, show where the restriction is actually used in the proof. 


Theorem 5.3.5 is an example of what is known as an existence theorem. 

It asserts the existence of the greatest common divisor of two integers. 

(a) What is the key step in the proof of Theorem 5.3.5 that guarantees 
the existence of the greatest common divisor? 

(b) Have we encountered any other existence proofs in this chapter? 
If so, what are they? 


Let a, b € Z, not both zero, and suppose there exist integers x and y 
such that ax + by = 2. Does this fact imply that (a, b) = 2? If not, 
what can be said about (a, b)? 


In Exercises 15 and 16, we looked at the greatest common divisors (n, 

n + 1) and (n, n + 2) for all integers n. 

(a) Formulate a similar result that characterizes (n, n + 3) for all 
integers n. 

(b) Generalize part (a) so that for a fixed positive integer k, you can 
characterize (n, n + k) for all integers n. 


Pythagorean triples give examples of the sum of two squares being itself 
a square. Can you find positive integers x and y such that x? + y’ isa 
cube? Does it seem likely that there are infinitely many triples (x, y, z) 
such that x? + y? = z3? 
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PRIMES AND UNIQUE FACTORIZATION 


Prime Numbers 


Every positive integer greater than 1 has at least two positive divisors: itself 
and 1. Some positive integers greater than 1 have the property that these 
are the only positive divisors. The integers 2, 3, 5, and 7 have this property 
but 4 and 6 do not; 6, for example, is divisible by 2 and 3. In order to study 
the set of integers in any depth, it is necessary to discuss the numbers with 
this property. 


Definition 5.4.1 An integer p greater than 1 is called a prime number 
if the only divisors of p are +1 and +p. If an integer greater than 1 is 
not prime, it is called composite. 


Prime numbers are the building blocks of the integers since every positive 
integer can be written as a product of primes in an essentially unique way. 
This important fact, called the Unique Factorization Theorem, will be proven 
in this section. 

The following characterization of composite numbers we leave as an ex- 
ercise. 


Let n € Z*,n > 1. Then x is composite if and only if there exist integers a 
and b such that n = ab where 1 <a<nand1l1<b<n. 


Although not every integer is prime, itis true that every integer is divisible 
by a prime number. The proof, which is a nice application of the Well- 
Ordering Principle, is given next. 


Every integer greater than 1 is divisible by a prime number. 


PROOF: Let T = {n € Z* | n> 1 and nis not divisible by a prime number}. 
It suffices to prove that T = &. Suppose, on the contrary, that T # ©. Then 
by the Well-Ordering Principle, T has a smallest element. Call it 1. 

Now ny itself cannot be prime, otherwise it would be divisible by a prime, 
namely itself. So by Lemma 5.4.2, there exist integers a and b such that 
Ny = ab where 1 <a<nand1<b<n. 

Since a is smaller than ny, a is not in T. Hence a is divisible by some 
prime p. Since p divides a and clearly a divides ny, by Proposition 5.3.3, p 
divides ny. This is a contradiction to the fact that m) is in T. Thus T = ©. 


We saw at the end of the last section that an integer x could divide the 
product of two numbers a and b without dividing either a or b. This will not 
happen, however, if x is prime, as we see in the next proposition. 


Proposition 5.4.4 


Corollary 5.4.5 


Corollary 5.4.6 


EXAMPLE | 


Theorem 5.4.7 
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If a prime number p divides a product ab, where a, b € Z, then p divides 
a or p divides b. 


PROOF: Suppose p does not divide a. Then a # 0 and so (p, a) = 1, since 
the only positive divisors of p are 1 and p. It now follows from Theorem 
5.3.9 that p divides b. @» 


The following two corollaries should be proved by induction. 


Let p be a prime and let a, m,... , a, © Z. If p divides the product 
A+ Q)*...* Am, then p divides a; for some i = 1, 2,..., m. 


PROOF: We leave it as an exercise. =? 


Let p be a prime and let a € Z. If p divides a” for some positive integer m, 
then p divides a. 


PROOF: We leave it as an Exercise. © * 


We will use Proposition 5.4.4 to prove that V2 is not a rational number. 
Suppose, on the contrary, that V2 is rational. Then we can write V2 = 
alb where a, b € Z and a/b is reduced to lowest terms. (Recall that being 
reduced to lowest terms is equivalent to saying that a and b have no common 
factors or in other words that a and b are relatively prime.) Hence a? = 2b’, 
which implies that 2 divides a’. It follows from Proposition 5.4.4 that 2 divides 
a. So a = 2c, for some c in Z. We get a = 4c? = 26° or 2c? = b’, which 
implies, again by Proposition 5.4.4, that 2 divides b. This is a contradiction 
to the fact that a and b have no common factors. 


Unique Factorization 


We are now in a position to prove one of the most important theorems 
about the integers, the Unique Factorization Theorem. The theorem has 
two parts: an existence part, which says that every integer greater than 1 is 
a product of primes, and a uniqueness part, which states that this product 
is unique except for the order in which the primes occur. We will use the 
Second Principle of Mathematical Induction to prove the existence part and 
rely heavily on Corollary 5.4.5 to prove the uniqueness part. 


Unique Factorization Theorem Let n € Z, n > 1. Then n is a prime number 
or can be written as a product of prime numbers. Moreover, the product is 
unique, except for the order in which the factors appear. 


PROOF: To prove the existence part of the theorem, namely, that every 
integer greater than 1 is a prime or a product of primes, we use the Second 
Principle of Mathematical Induction. (See Theorem 5.2.3.) 
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Let P(n) be the statement: m = 1, n is prime or n is a product of primes. 
Trivially, P(1) is true. 

Suppose that k € Z* and that P(i) is true for allisuchthat1 =i=k. 
We will show that P(k + 1) is true. 

If k = 1, then P(k + 1) = P(2) is true since 2 is prime. 

Now suppose that k > 1. The induction hypothesis tells us that every 
integer i such that 1 <i = k is prime or a product of primes. If k + 1 is 
prime, then clearly P(k + 1) is true. If k + 1 is not prime, then by Lemma 
5.4.2,k + 1= ab, wherel <a<k+1land1<b<k+1. 

By the induction hypothesis, both a and b are primes or products of 
primes. Therefore k + 1 = ab is a product of primes, again implying P(A + 
1) is true. It now follows by induction that P(n) is true for all integers n = 
1. Therefore, every integer n > 1 is a prime or a product of primes. 

To prove uniqueness, suppose that nm = p,- po: ...° ps and that n 
N° 4Qz° +++" Gy Where pi, Pr, --+ 5 Dss Nis G2, -++ 5 Gr are primes. 

We will show that s = tand that after suitable renumbering, p; = q; for 
alli = 1, 2,..., 5. We can assume, without loss of generality, that s = ¢. 

Since p, divides n, p, divides gq: + q.+...+ q:. By Corollary 5.4.5, p; divides 
q; for some j. We now renumber the gqjs so that q; = q. 

Because p, and q, are primes, we must have p, = ee It follows from 


Exercise 16 of Section 5.1 that p) - p3-...° Ds = qo Qt. 

Now p, divides q2 + q3°...° G;, SO again aby Garon . 4, 5, P2 divides q; 
for some j. Letting g; = q., we have p. = q. 

Continuing this process, after s steps, we get pj = q;fori=1,2,...,5. 
If s < t, we would have the equation 1 = qy4; * Gsin* --- * G;, a contradiction. 


Therefore s = ¢ and p; = q; for all i. This completes the proof of the 
theorem. @» 


The integer n = 22540 = 2-2-5-7-7- 23 and any factorization of this 
number must be the same except for a rearrangement of the factors. 


Let n € Z*,n > 1. In writing n as a product of primes, if we group 
together the primes that appear more than once, then we can write n in 
the form 


mn. 


n= pt: pyr... ph 


where py, p2,..., Pp, are distinct primes, p) <p) <...< pm, and the expo- 
nents m,, m,, ... m, are positive integers. We say that n is written in 
standard form. 


From the previous example, we can write 


22540 = 2?-5- 7+ 23. 


Theorem 5.4.8 


Theorem 5.4.9 
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MATHEMATICAL PERSPECTIVE: EUCLID'S THEOREM AND SOME 
GENERALIZATIONS 


There are many interesting and challenging problems involving prime num- 
bers. For example, we show here that there are infinitely many prime num- 
bers. The proof is essentially the same as the one given by Euclid in the 
third century B.c. 


Euclid’s Theorem There are infinitely many prime numbers. 


PROOF: Suppose that, on the contrary, there are only finitely many 


primes pi, P2,..., p,. Let m = p, + py: ... + p, + 1. Let p be a prime 
that divides m. (Why must such a prime exist?) Because p = p; for some 
i= 1,2,...,~7, p also divides p, - p.: ...- p,. Therefore, p divides 
m— Pp,‘ pr: ...* p, = 1, acontradiction. Thus there must in fact be infinitely 


many primes. ©» 


There are many generalizations of Euclid’s Theorem. For example, it is 
known that there are infinitely many primes in the set {1, 5, 9, 13, 17, ...}. 
More generally, a famous theorem due to Peter Gustav Lejeune Dirichlet 
(1805-1859) states that there are infinitely many primes in any arithmetic 
progression of the form {an + b | n = 1, 2, 3, ...} where a, b © Z* and 
(a, b) = 1. The set {1, 5, 9, 13, 17, ...} is an example with a = 4 and 
b = 1. Although the proof of Dirichlet’s Theorem is beyond the scope of 
this book, a special case is proven next. 


There are infinitely many primes of the form 4n + 3. 


PROOF: Suppose that, on the contrary, there are only finitely many primes 
Pi, P2,---, Pp, of the form 4n + 3. Since 3 is such a prime, we will assume 
that p, = 3. Let N = 4p)p;... p, + 3. Since N is of the form 4n + 3, there 
must be at least one prime divisor of N of the form 4n + 3. (See Exer- 
cise 18 following.) Call this prime p. If p = 3, then we would have that 
3 | 4p.p; ... p,, implying that 3 | p; for some i, a contradiction. On the 
other hand, if p # 3, then p = p; for some i since p;, po, ... , Pr are, by 
assumption, all of the primes of the form 4n +3. But since p | N and 
p | 4p.p3... p,, we get that p | 3, again a contradiction. Therefore, there 
must in fact be infinitely many primes of the form 4n + 3. == 


Some deceptively simple problems concerning primes remain unsolved, 
even after centuries of effort by mathematicians. For example, two primes 
that differ by 2 are called twin primes: 3 and 5, 5 and 7, 11 and 13, and 17 
and 19 are such pairs. Remarkably, it is unknown whether there are infinitely 
many pairs of twin primes. 

Another unsolved problem involving primes is the assertion, known as 
the Goldbach Conjecture, that every even integer greater than 4 is the sum 
of two odd prime numbers. This conjecture has been verified for all even 
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integers less than 10". So although just about everyone believes that it is 
true, no general proof seems forthcoming in the immediate future. 


Exercises 5.4 


. Prove Lemma 5.4.2. 


. Letn € Z,n > 1. Prove that if 1 is not divisible by any prime number 


less than or equal to Vn, then n is a prime number. 


. Let n be a positive integer greater than 1 with the property that whenever 


n divides a product ab where a, b € Z, then n divides a or n divides b. 
Prove that n is a prime number. 


4. Prove Corollary 5.4.5. 


5. Prove Corollary 5.4.6. 


10. 


11. 


. (a) Prove that V3 is irrational. 


(b) Prove that V2 is irrational. 

(c) Prove that V2 is irrational for every n € Z, n = 2. 

(d) Prove that if p is a prime number, then Wp is irrational for every 
nEZ,n=2. 

(e) Letn, m € Z, n = 2. Prove that if m is not the nth power of an 
integer, then W/m is irrational. 


. (a) Prove that log;y 3 is irrational. 


(b) Prove that if r is a rational number such that r > 1 and r # 10” for 
any positive integer n, then logyr is irrational. 


. Write the following integers in standard form: 


(a) 594 (b) 1,400 
(c) 42,750 (d) 191,737 


. Letn € Z,n = 1. Prove that n is a perfect square if and only if, when 


n is written in standard form, all of the exponents are even. 


Let a, be Z. 
(a) Prove that if a? | b’, then a | b. 
(b) Prove that if a" | b" for some positive integer n, then a | b. 


(a) Let a, b © Z such that (a, b) = 1. Suppose that ab = x’ for some x 
in Z. Prove that a = y’ and b = 2? for some y and z in Z. 

(b) Show that part (a) is false without the assumption that a and b are 
relatively prime. 

(c) Let a, b € Z such that (a, b) = 1. Suppose that ab = x" for some x 
in Z and some positive integer 1. Prove that a = y" and b = 2” for 
some y and z in Z. 

(d) Leta, b,c € Z* such that (a, b) = (a,c) = (b, c) = 1. Suppose that 
abc = x* for some x in Z*. Prove that a, b, and c are all squares in Z*. 


5.4 


12. 


13. 
14. 
15. 


16. 


17. 


18. 


19. 


20. 
21. 


22. 


23. 


24. 


25; 
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(e) Let a), a),...,a, € Z* such that (a, a;) = 1 if i + j. Suppose that 
a\a,...a, = x? for some x in Z*. Prove that each q; is a square in Z*. 


Prove that if a positive integer of the form 2” + 1 is prime, then mis a 
power of 2. 


Prove that 2 is the only prime of the form n> + 1. 

Prove that if 2” — 1 is prime, then n is prime. 

Investigate the following statement: 

If n is any positive integer, then n? + n + 41 is always a prime number. 


If you think it is true, give a proof; if false, give a counterexample. 


Leta, be Zia>1,b>1. Leta = phi: pe:...+ pr and b = 
pi: pe: ...* pr, where pi, po, ..., Pp, are primes and m, and n, are 
nonnegative integers, fori = 1,2,...,7r. Let J, = min(m,, n;). Prove that 


(a, b) = p+ p3-...+ pt. 
Use Exercises 8 and 16 to find the greatest common divisor of 1,400 
and 42,750. 


Prove that if a is a positive integer of the form 4n + 3, then at least one 
prime divisor of a is of the form 4n + 3. 


Prove that if ais a positive integer of the form 3” + 2, then at least one 
prime divisor of a is of the form 3n + 2. 


Prove that there are infinitely many primes of the form 3n + 2,n € Z*. 


Prove that there are infinitely many primes of the form 6n + 5,n € Z’. 


Let n be a positive integer. Prove that the binomial coefficient ( >) iS 


divisible by every prime p such that n < p < 2n but is not divisible by p’. 


Let p be a prime number and ¢ a positive integer. Let a € Z. Suppose 
that a divides p'. Prove that a = p* forsome k € Z,1 Sk St. 


Let n, m € Z, (n, m) = 1. Suppose that d is a positive divisor of nm. 
Prove that there exist positive integers d; and d, such that d = d,d, where 
d, divides n and d) divides m. 


If nis a positive integer, let 7(n) denote the number of positive divisors 

of n. So, for example, 7(1) = 1, 7(2) = 2, 7(3) = 2, 7(4) = 3, 7(5) = 2, 

7(6) = 4. 

(a) Prove that if p is a prime number and ¢ is a positive integer, then 
(p')=rtt+1. 

(b) Let n, m € Z, (n, m) = 1. Prove that t(nm) = 7(n)7(m). 

(c) Letn € Z. Let n = php® ... p% be the prime factorization of n. 
Prove that 7(n) = (a; + 1)(a + 1)... (a, + 1). 
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(d) Prove that an integer n has an odd number of divisors if and only 


if n is a perfect square. 


Discussion and Discovery Exercises 


D1. 


D2. 


D3. 


D4. 


DS. 


D6. 


In Example 1, we proved that V2 is irrational. This fact was also proved in 
Example 9 of Section 1.4. Are the two proofs the same? If not, how do 
they differ? 


Let p, denote the nth prime number; that is, p, 2, Pr = 3, p3 = 5, 
Ps = 7,..., and so on. Let g, = pip2.p3... Pr + 1. So, for example, 
ga =24+1=3,q=2°:3+1=7,q,;=2:°3:54+1=3]1,...,andsoon. 
(a) Prove that any prime divisor of g,, must be greater than p,. 

(b) It seems that the q,s themselves are primes. Prove or disprove. 


The modern discipline of cryptography is concerned with the develop- 
ment of secret codes. These codes depend for their effectiveness on the 
use of large primes. One source of large primes comes from the so- 
called Mersenne primes. Mersenne primes, named after the 17th century 
mathematician and monk Marin Mersenne, are primes of the form 
2" — 1. Of course not all numbers of this form are prime, and in fact if 
2” — 11s prime, then the exponent n must itself be prime. Some Mersenne 
primes are: 3, 7, 31, and 127. 
(a) Prove that 2'? — 1 is a Mersenne prime. 

Hint: Exercise 2 of this section will help. 
(b) Prove that 2'' — 1 is not a Mersenne prime. 


Pythagoras and his followers were intrigued by the properties of integers. 
Certain numbers they designated as perfect numbers. These were num- 
bers whose proper divisors added up to the number itself. For example, 
6 is a perfect number because the proper divisors of 6 (that is, all divisors 
except 6), are 1,2, and 3 and 1 + 2 + 3 = 6. The next perfect number 
is 28. Euclid was able to prove that n is an even perfect number if and 
only ifn = 2?~'(2” — 1) where 2? — 1 is a Mersenne prime. No odd perfect 
numbers have been found and it is conjectured that there are none. 
(a) Verify by direct calculation that 28 is a perfect number. 
(b) Use Euclid’s Theorem to find two more perfect numbers. 
(c) Prove that every even perfect number is a triangular number. (See 
the Discussion and Discovery Exercises in Section 5.2.) 


Conjecture a formula for the sum of the first n consecutive odd cubes. 
Your formula should be of the form 1° + 33+ 5° +... + (2n - 1 = 
p(n) where p(n) is some polynomial expression in the variable n. Then 
prove your formula. 


Using the formula you proved in the previous exercise, prove that every 
even perfect number greater than 6 is the sum of the first n consecutive 
odd cubes for some positive integer n. 
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Congruences and Their Properties 


Consider the following algebra problem: are there any integers x and y such 
that x? = 4y + 3? One could plug in integers for x and y to see if a solution 
with small values exists. But this approach would be futile if in fact, as is 
the case here, there is no solution. To see that there is no solution, we can 
argue as follows. 

Since 4y + 3 is odd, x? is odd, and by Theorem 1.4.6, x must be odd also. 
Then we can write x = 2t + 1 for some tin Z. So x? = 4° + 4¢ + 1. It then 
follows that 4(¢? + t — y) = 2 or that ¢? + t — y = 3, which is impossible, 
since #? + t — y is an integer. Note that here we use the fact that only +1 
have multiplicative inverses in Z. (See Corollary 5.1.8.) 

If we analyze this proof, we see that it can be more succinctly stated as 
follows: if the square of an integer is divided by 4, the remainder will be 0 
or 1 and hence cannot be written in the form 4y + 3. 

In this section we introduce the language of congruences so that proofs 
like the one above can be written in a more systematic and efficient way. 
Recall that in Section 4.2, we defined the notion of congruence modulo n, 
where n is a fixed positive integer; that is, two integers a and b are congruent 
modulo n if the difference a — b is divisible by n. We write a = b(mod n). 

In the exercises of Section 4.2, you were asked to prove that congruence 
mod n defines an equivalence relation on Z. The following proposition is 
equivalent to that exercise. 


Let a, b, c € Z and letn € Z. 


1. a = a(mod n) 
2. If a = b(mod n), then b = a(mod n). 
3. If a = b(mod n) and b = c(mod n), then a = c(mod n). 


Many of the algebraic properties of congruences are similar to corre- 
sponding properties of equalities. More specifically, we have: 


Let a, b,c, d€ Zandletn € Z*. 


1. If a = b(mod n) and c = d(mod n), then a + c = b + d(mod n) and 
ac = bd(mod n). 
2. If ab = ac(mod n) and (a, n) = 1, then b = c(mod n). 


PROOF: 1. Suppose that a = b and c = d(mod n). Then n | a — b and 
n|c — d, so there exist integers s, t such that a — b = ns andc — d = nt. 

Hence (a + c) — (b + d) = (a— b) + (c — d) = n(s + 2), which implies 
thata + c= b+ d(mod n). 
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Similarly, ac — bd = ac — be + be — bd = (a — b)c + D(c — d) = 
n(sc + bt), so we have that ac = bd(mod n). 
The proof of 2 is left as an exercise. 


Notice that the cancellation property in 2 of Proposition 5.5.2 does not 
necessarily hold if a and n are not relatively prime. For example, (3)(5) = 
(3)(13)(Gmod 24) but 5 and 13 are not congruent mod 24. 

The following corollary of Proposition 5.5.2 can be proved by induction. 


If a = b(mod n), then a* = b*(mod n) for every positive integer k. 


PROOF: We leave it as an exercise. @” 


The following proposition is very useful for doing problems involving con- 
gruences. 


Letn € Z,n > 1. Ifa € Z, then a is congruent mod n to exactly one of the 
integers 0,1,2,...,n —1. 


PROOF: Let a € Z. By the Division Algorithm (Theorem 5.3.1), there exist 
q, rin Z such that a = nq + r, whereO Sr<n. 

Thus a — r is divisible by n, so a = r(mod n). Hence a is congruent to 
one of the integers 0, 1,2,...,n — 1. 

If a is also congruent to some s where 0 = s < n, then r = s(mod n). 
But since r and s are both less than n, their difference cannot be divisible 
by 1 unless r = s. Therefore a is congruent mod n to exactly one of the 
integers 0,1, 2,...,n —1. 


We will show that the square of any integer is congruent to 0 or 1 mod 4. 
Let x € Z. By Proposition 5.5.4, x = 0, 1, 2, or 3(mod 4). By Corollary 5.5.3, 
x? = 0?, 12, 22, or 3°(mod 4). But 0? = 0, 1? = 1, 2? = 0, 3? = 1(mod 4). 
Therefore, by the transitive property of congruences, x? = 0 or 1(mod 4). 
One way to interpret this result is that the square of any integer leaves a 
remainder of 0 or 1 when divided by 4. 

The importance of the congruence approach is that the proof reduces to 
considering just four cases. In general, working with a congruence mod n 
will involve working with only the n integers 0, 1, 2,...,” — 1. 


We can now use congruences to solve the problem stated at the beginning 
of this section; namely, to show that there are no integers x and y such that 
x= 4y + 3. 

Suppose such integers x and y exist. Then x? — 3 is divisible by 4 or, in 
other words, x? = 3(mod 4). But the previous example shows that x? = 0 or 
1(mod 4). Hence we have a contradiction. So we can conclude that no such 
integers exist. 
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For each a € Z, we let [a] be the equivalence class of a with respect to 
the equivalence relation of congruence mod n: [a] = {x € Z| x = a(mod n)}. 

We will call [a] the congruence class of a mod n. 

By Proposition 5.5.4, if a © Z, then a = r for exactly one integer r such 
that 0 =r =n — 1. Therefore a € [r]. Thus there are exactly n distinct 
congruence classes; namely, [0], [1], [2],. . ., [” — 1]. To see which equivalence 
class a given integer a is in, we just compute the remainder when a is divided 
by n. 


When n = 2, there are two congruence classes: [0] = {x € Z| x = 0(mod 
2)} is the set of even integers and [1] = {x € Z | x = 1(mod 2)} is the set of 
odd integers. 


When n = 3, there are three congruence classes: 


[0] = {x € Z| x = 0(mod 3)} is the set of multiples of 3; 

[1] = {x € Z| x = 1(mod 3)} is the set of integers that leave a positive 
remainder of 1 when divided by 3; and 

[2] = {x € Z| x = 2(mod 3)} is the set of integers that leave a positive 
remainder of 2 when divided by 3. 


Notice that these three classes are distinct and that every integer is in 
exactly one of them. 


Let n = 6. Then there are six congruence classes: [0], [1], [2], [3], [4], and 
[5]. Let a = 74. Since the remainder when 74 is divided by 6 is 2, it follows 
that 74 € [2]. By the transitive property of congruences, any integer congruent 
to 74 mod 6 will be congruent to 2 mod 6. Similarly, any integer congruent 
to 2 mod 6 will be congruent to 74 mod 6. Hence [74] = [2]. 


The previous example shows that there is more than one way to represent 
a congruence class. For n = 6, [2] = [74]. Similarly, [2] = [8] = [14] = [20] 
and [3] = [9] = [15] = [21]. For n = 13, [4] = [17] = [30] = [43]. 

In general, for congruences mod n, [a] = [b] if and only if a = b(mod 
n). We leave the proof of this fact as an exercise. 


Reminders: 


1. For any integer a, [a] is a set, not an integer. 

2. If 0 = a <n, then [a] can be described as the set of integers that 
give a remainder of a when divided by n. 

3. If [a] = [b], it does not mean that a = 5, only that a = b(mod 
n) or that a and b give the same remainder when divided by n. 
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The Set of Congruence Classes 


Forn € Z,n > 1, we will let Z, denote the set of congruence classes mod 
n. Thus Z, = {[0], [1], [2], ..., [2 — 1}. 

In Chapter 4 we discussed the notion of a binary operation on a set; that 
is, a rule for combining two elements of a set to give a third element of the 
set. There are two important binary operations on the set Z,. We define 
them next. The first one will be called addition because it depends on ordinary 
addition of integers and the second will be called multiplication because it 
depends on ordinary multiplication of integers. 

First we define addition on Z, using the symbol + as follows: if [a], 
[b] € Z,, then [a] + [b] = [a + 5], for any a, b € Z. 

So, for example, if n = 7, [4] + [2] = [6] and [5] + [4] = [9] = [2]. 

An apparent problem with this definition is that it seems to depend on 
the choice of the integers a and b to represent the equivalence classes [a] 
and [b]. If this definition gives a real binary operation, then the sum of the 
classes [a] and [b] should be the same if a and 5b are replaced by other 
integers representing the same classes. In that case we can say that addition 
is well defined. 

For example, suppose n = 6. Then [3] + [5] = [8] = [2]. But we can also 
write [3] = [15] and [5] = [47] so adding [15] and [47] should give the same 
answer as adding [3] and [5]. We check: [15] + [47] = [62] = [2]. 

More generally, we must show that if [a] = [a’] and [b] = [b’], then 
[a + b] = [a’ + 6’). But if [a] = [a’] and [b] = [b’], then a = a’ and b = 
b'(mod n). Now applying Proposition 5.5.2, we get thata + b=a' + 
b'(mod n), which implies that [a + b] = [a’ + b']. We have now proved 
that addition on Z,, is well defined. 

The next theorem gives some important algebraic properties of addition 
in Z,. 


LetnE Zn> 1. 


1. Addition in Z,, is commutative: [a] + [b] = [b] + [a] for all a, b € Z. 
2. Addition in Z, is associative: ([a] + [b]) + [c] = [a] + ([b] + [c]) for 
alla, b,c € Z. 
3. [0] is the identity element of Z,, with respect to addition: [a] + [0] = 
[a] = [0] + [a] for alla € Z. 
4. Every element of Z, has an inverse with respect to addition. For any 
ain Z, the additive inverse of [a] is [—a]. 
PROOF: 1. Let a, b € Z. Then [a] + [b] = [a+ b] = [b + a] = [b] + [a]. 
Note that in the step [a + b] = [b + a], we use the fact that addition in Z 
is commutative. 
The proofs of 2, 3, and 4 are left as exercises. “” 


Next we define multiplication on Z, by letting [a][b] = [ab], for all a, 
bEZ. 
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As with addition, it is necessary to show that this binary operation is well 
defined. We leave the proof as an exercise. 


LetnE€ Zn> 1. 


1. Multiplication in Z, is commutative: [a][b] = [b][a] for all a, b € Z. 

2. Multiplication in Z, is associative: ([a][b])[c] = [a]((b][c]) for all a, b, 
ceEZ. 

3. [1] is the multiplicative identity of Z,: [a][1] = [a] = [1][a] for all 
acZ. 

4. The following distributive laws hold: [a]({b] + [c]) = [a][b] + [a][c] 
and ({a] + [b])[c] = [a][c] + [b][c] for all a, b, c € Z. 


PROOF: We prove 2 and leave the rest as exercises. If a, b, c € Z, ({a][b]) 


[c] = [ab]lc] = [(ab)c] = [a(be)] = [a][bc] = [a]({b][c]). 


Because Z,, is a finite set, it is possible to give addition and multiplication 
tables for Z, when n is small. Following are the tables for Z; and Ze: 


Zs 
+ [0] {1] [2] [3] [4] 
[0] [0] (1) [2] [3] [4] 
[1] 0) [2] [3] [4] [0] 
[2] [2] [3] [4] [0] (1) 
[3] [3] [4] [0] (1) [2] 
[4] [4] [0] [1] [2] [3] 


; [0] [1] [2] [3] [4] 
[0] [0] [0] [0] [0] [0] 
[1] [0] [1] [2] [3] [4] 
[2] [0] [2] [4] (1) [3] 
[3] [0] [3] 01) [4] [2] 
[4] [0] [4] [3] [2] (1) 


Zs 

- (0) [1] [2] [3] [4] [5] 
[0) [0] (1) [2] [3] [4] [5] 
[1] (1) [2] [3] [4] [5] [0] 
[2] [2] [3] [4] [5] [0] (1) 
[3] [3] [4] [5] [0] 01) [2] 
[4] [4] [5] [0] (1) [2] [3] 
[5] [5] [0] [1] [2] [3] [4] 
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(0) [1] [2] [3] [4] [5] 
(0) [0] [0] [0] [0] [0] [0] 
[1] [0] (1) [2] [3] [4] [5] 
[2] [0] [2] [4] [0] [2] [4] 
[3] [0] [3] [0] [3] [0] [3] 
[4] [0] [4] [2] [0] [4] [2] 
[5] [0] [5] [4] [3] [2] [1] 


Unlike addition, not every element of Z,, has an inverse with respect to 
multiplication. An element [a] has an inverse with respect to multiplication 
if there exists [x] in Z,, such that [a][x] = [1]. 

From the multiplication table for Z;, we see that each element except 
(0] has a multiplicative inverse. For example, (2]~! = [3] and [4]! = [4]. 

In Z, however, only [1] and [5] have multiplicative inverses and 
[5}" = [5]. 

The following theorem gives an explicit criterion for an element of Z, 
to have a multiplicative inverse. 


Let [a] € Z,,. Then [a] has a multiplicative inverse if and only if a and n are 
relatively prime. 


PROOF: If [a] has a multiplicative inverse, then there exists an element 
[x] € Z, such that [a][x] = [1]. It follows that [ax] = [1] and hence ax = 
1(mod n). (Why?) 

This implies that n divides ax — 1, which means that ax — 1 = nt for some 
t € Z. Hence ax + n(—t) = 1. By Theorem 5.3.8, a andn are relatively prime. 

Conversely, if a andv are relatively prime, again by Theorem 5.3.8, there 
exist integers x and y such that ax + ny = 1. Hence [1] = [ax + ny] = 
[ax] + [ny] = [a][x] + [n][y]. 

But[n] = [0], so we have [a][x] = [1], and therefore [x] isthe multiplicative 


The proof of Theorem 5.5.7 tells us how to find the multiplicative inverse 
of an element of Z,; namely, [x] is the inverse of [a] if and only if x is a 
solution of the congruence ax = 1(mod n). The next theorem gives us a 
procedure for solving such congruences. 


Leta, b,n € Z,n > 1, and suppose that (a, n) = 1. Then the congruence 
ax = b(mod n) has a unique solution mod n; that is, there is an integer x 
such that ax = b(mod n), and if y is another integer such that ay = b(mod 
n), then y = x(mod n). 


PROOF: Since (a, n) = 1, there are integers t and s such that as + nt = 1. 
Multiplying by b, we get asb + ntb = b. Letting x = sb, we have ax = b(mod 
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n). Ifay = b(mod n) for some y in Z, then ax = ay(mod n), and by Proposition 
5.5.2, x = y(mod n). © 


Notice that if x is a solution to the congruence ax = b(mod n), then the 
set of all solutions to the congruence is the congruence class [x]. 


We solve the congruence 11x = 9(mod 29). The proof of Theorem 5.5.8 tells 
us how to proceed. We first must find integers s and ¢ such that as + nt = 
1. The procedure for finding such s and t was described in Example 5 o 
Section 5.3. In this case, we get s = 8 and t = —3. Therefore, x = sb = 
(8)(9) = 72, and so any integer congruent to 72(mod 29) is a solution of 
the congruence. 

To find the least positive solution, we merely compute the remainder 
when 72 is divided by 29. Thus the least positive solution to the congruence 
11x = 9(mod 29) is 14. 

The set of all solutions to the congruence is the congruence class [14] 
in Zn. 


=> 


In Z,, the elements with multiplicative inverses are [1], [5], [7], and [11]. 
To find the inverse of [7], for example, we must solve the congruence 7x = 
1(mod 12). Since 1 = 12(3) + 7(—5), x = —5 = 7(mod 12) is a solution. 
Hence [7]! = [7]. 


4 MATHEMATICAL PERSPECTIVE: FERMAT’S LITTLE THEOREM 


One of the most famous theorems on congruences was proven by Pierre de 
Fermat (1601-1665). Fermat was a French judge who worked as a Councilor 
in the parliament at Toulouse. But his great love was mathematics, especially 
number theory. Although Fermat was only an ‘“‘amateur’’ mathematician, 
he formulated some very sophisticated theorems. With one exception, he 
did not give proofs of his theorems but he undoubtedly knew how to prove 
many of them. However, some of his results were so difficult that they were 
only proved years later by mathematicians such as Euler. He also made some 
claims that he was not able to prove or that were false. For example, Fermat 
claimed that integers of the form 2° + 1 were always prime. Numbers of 
this form have come to be known as Fermat numbers. It is easy to check 
that 2?" + 1 is prime for n = 0, 1, 2, 3, 4. But 2 + 1 is not prime since it is 
divisible by 641. In fact it is known that 2?" + 1 is not prime for 5 <= n = 22. 
Moreover, there are no known values of n > 4 for which 2” + 1 is prime. 

Fermat’s theorem on congruences, sometimes referred to as Fermat’s 
Little Theorem, says that if 1 is any integer and p is a prime number, then 
n’ — nis divisible by p. In congruence notation, it says that n? = n(mod p). 

For example, 5’ — 5 must be divisible by 7. Indeed, 5’ — 5 = 78120 = 
(7)(11160). 
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A corollary of Fermat’s Little Theorem is that if n is relatively prime to 
p, then n?~' = 1(mod p). This corollary can be used to find the remainder 
when certain large powers are divided by a prime. 


Find the remainder if 57° is divided by 11. By the corollary to Fermat’s 
Little Theorem, 5° = 1(mod 11). Since 213 = (10)(21) + 3, we get 5°? = 
(5!°)7153 = 5 = 125 = 4(mod 11). Therefore, the remainder is 4. 


The proof of Fermat’s Little Theorem can be done by induction. We 
leave the proof and the proof of the corollary as exercises. 

An interesting problem is to consider the converse of Fermat’s Little 
Theorem. Its statement would be: if k is a positive integer and if n* = n(mod 
k) for all integers n, then k is a prime. Equivalently, this could be stated in 
its contrapositive form: if k is not a prime, then there exists an integer n 
such that n‘ is not congruent to n mod k. 

How does one go about proving or disproving this converse? If no proof 
presents itself immediately, one might try different cases. This would mean 
that for a given composite number k, we would need to find an integer n 
such that * is not congruent to n mod k. Let’s try some. 


ll 


4: 2‘ is not congruent to 2 mod 4. 
6: 2° is not congruent to 2 mod 6. 
8: 2° is not congruent to 2 mod 8. 
9: 
1 


2° is not congruent to 2 mod 9. 
0: 2! is not congruent to 2 mod 10. 


a ae 
ll 


We see from these examples that not only does the converse of Fermat’s 
Little Theorem hold in these cases but we don’t have to look far to find the 
integer n. In all of these cases, 2‘ is not congruent to 2 mod k. Might it be 
the case that for every composite number k, 2‘ is not congruent to 2 mod 
k? Unfortunately, this is not the case since 2%! = 2(mod 341) and 341 = 
(11)(31) is not prime. From Exercise 29 of this section, to see that 2! = 
2(mod 341), it suffices to show that 2°! = 2(mod 11) and 2*" = 2(mod 31). 
Toprove each of these congruences Fermat’s Little Theorem and its corollary 
can be used. We leave the details as an exercise (Exercise 30(a)). 

But 341 does not give a counterexample to the converse of Fermat’s 
Little Theorem since 3*' is not congruent to 3 mod 341. It is not until we 
consider 561 = (3)(11)(17) that we arrive at the correct counterexample. It 
can be shown that for every integer n, n**! = n(mod 561). So indeed the 
converse of Fermat’s Little Theorem is false. 

Numbers k with the property that n* = n (mod k) for all integers n are 
called Carmichael numbers. Other Carmichael numbers are 1105, 1729, and 
2821. Carmichael numbers occur infrequently. There are only 43 of them 
less than one million. However, it was proved in 1992 that there are infinitely 
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many Carmichael numbers. A description of this last result can be found in 
What’s Happening in the Mathematical Sciences 1993. [2] 


Exercises 5.5 


1. 


13. 


14. 
15. 


Let x € Z. 

(a) Prove that if x is even, then x? = 0(mod 4). 

(b) Prove that if x is odd, then x? = 1(mod 4). 

(c) State the converses of parts (a) and (b). Prove or disprove each. 


. Let x € Z. 


(a) Prove that if x is even, then x? = 0 or 4(mod 8). 
(b) Prove that if x is odd, then x? = 1(mod 8). 


. (a) Prove that there are no integers x and y such that x? = 8y + 3. 


(b) Prove that there are no integers x and y such that x? = 5y + 2. 


. Prove that there are no integers x, y, z such that x? + y? = 8z + 3. 
. Prove that there are no integers x and y such that x? + 2y? = 805. 
6. 


Prove that if n © Z and n = 2(mod 3), then there are no integers x and 
y such that n = x? + 3y?. 


. (a) Let x be an odd positive integer. Prove that if x = 7(mod 8), then 


x cannot be written as the sum of three integer squares. 


(b) State the converse of the implication in part (a). This converse is 
true but its proof is difficult. 


. Find the congruence class in Z,, of each of the following integers. Write 


your answer in the form [x] where 0 = x = 16. 
(a) 29 (b) 166 (c) 10,227 


. Prove part 2 of Proposition 5.5.2. 
. Prove Corollary 5.5.3. 
. Prove that in Z,, [a] = [b] if and only if a = b(mod n). 


. (a) Prove that addition in Z, is associative. 


(b) Prove that [0] is the additive identity of Z,,. 
(c) Prove that every element of Z, has an additive inverse. 


(a) Prove that multiplication in Z,, is well defined. 
(b) Prove that multiplication in Z, is commutative. 
(c) Prove that [1] is the multiplicative identity of Z,,. 


Prove the distributive laws in Z,, as stated in part 4 of Theorem 5.5.6. 


Compute the following for the given Z,. Write your answer in the form 
{a] where OS asn-1. 

(a) [21] + [19] in Zi 

(b) [34][27] in Zs 

(c) [33]}([83] — [67]) in Zio 
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16. 


17. 


18. 


19. 


20. 


21. 


22. 


23: 


24. 


25. 


26. 


27, 


28. 


29. 


(d) The additive inverse of [18] in Zo. 


Find the least positive integer x that satisfies the following congruences: 
(a) 6x = 57(mod 23) 

(b) 90x = 41(mod 73) 

(c) 11x = 211(mod 571) 

Find the multiplicative inverses of the following elements: 

(a) [8] in Z,, (b) [11] in Zs, 

(c) [9] in Zi6 (d) [201] in Z614 

Write out the addition and multiplication tables for Z,, Zs, and Z). Then 
list the elements in each of these sets that have multiplicative inverses. 


Letn € Z,n > 1. Prove that every nonzero element of Z,, has a multiplica- 
tive inverse if and only if n is prime. 


Let n € Z,n > 1. Let a € Z such that a = 1(mod n). Prove that 
[a]* = [a]. 

Let a, b,n © Z,n > 1. Prove that if a = b(mod n), then a = b(mod m) 
for every min Z, m > 1, such that m divides n. 


Let a, x, y,n © Z,n > 1. Prove that if ax = ay(mod n), then x = 
y (moa “), where d = (a, n). 
Hint: See Exercise 11 of Section 5.3. 


Let p be a prime number and let a, b € Z. 
(a) Suppose that a? = b*(mod p). Prove that a = +b(mod p). 
(b) Does the result of part (a) hold if p is not prime? Prove your answer. 


Letn € Z,n > 1. Without using any results on equivalence classes from 
Chapter 4, prove that 

(a) Z is the union of the distinct congruence classes mod n. 

(b) Any two distinct congruence classes mod n are disjoint. 


Let p be a prime. Prove that the binomial coefficient 2 is divisible by 


p for every integer k such thatl =k=p-—1. 


Prove Fermat’s Little Theorem; that is, if p is a prime, then for every 
integer n, n’ — nis divisible by p. 

Hint: First use induction to prove it for all positive integers n. The Binomial 
Theorem and the previous exercise will be help ful. 


Prove the corollary to Fermat’s Little Theorem: if p is a prime and n is 


an integer relatively prime to p, then n?! = 1(mod p). 


(a) Find the remainder if 19° is divided by 13. 
(b) Find the remainder if 11?! is divided by 35. 


Letn, m € Z* anda, b € Z. Suppose that a = b(mod n) and a = b(mod 
m) and that (n, m) = 1. Prove that a = b(mod nm). 
Hint: See Exercise 13 of Section 5.3. 


DD. 


30. 


31. 


32. 
33. 
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(a) Prove that 2%! = 2(mod 341). 
b) Prove that 3°” is not congruent to 3 mod 341. 
g 


Suppose that k = p,p) ... p, where the ps are distinct primes and 
r > 1. Suppose also that for each i = 1, 2,...,7r, p; — 1 divides k — 1. 
Prove that k is a Carmichael number. 


Verify that 1105 and 1729 are Carmichael numbers. 


Let f, = 27" + 1 be the nth Fermat number. 

(a) Prove by induction that for alln = 1 fififi...fr-=f, — 2. 

(b) Prove that if m and m are distinct positive integers, then f, and f,, 
are relatively prime. 

(c) Use part (b) to give another proof that there are infinitely many 
primes. 


Discussion and Discovery Exercises 


D1. 


D2. 


D3. 


Back in the 17th century, Fermat considered the problem of determining 

which positive integers can be written as the sum of two squares. For 

example, 2 = 17 + 17,5 = 2? + 17,8 = 2? + 27, 10 = 3° + 1’, and so 
forth. But not all integers can be written as the sum of two squares. It 
is easy to see, for example, that 3 and 7 cannot. In a letter to Mersenne, 

Fermat claimed that he had a proof that any prime congruent to 1 mod 

4is asum of two squares. The first published proof of Fermat’s Theorem 

was given by Euler about one hundred years later. 

(a) Verify Fermat’s Theorem for primes up to 37. 

(b) Prove the converse of Fermat’s Theorem; that is, if p is an odd 
prime and p is a sum of two squares, then p = 1(mod 4). 

(c) List the first 20 integers, not necessarily primes, that are the sum 
of two squares. 

(d) Based on your answer to part (c), state a theorem that characterizes 
all integers that are the sum of two squares. 

(e) Find a positive integer that can be written as the sum of two squares 
in two different ways. (Note: they must truly be different ways. 
For example, if n = x? + y?, then don’t consider the equations 
n=y +x? orn = (+x) + (+y) as new representations.) 


Explain in your own words the meaning of the set Z,. Make your 
explanation clear enough to be understood by a group of freshman 
math majors who know what the set of integers is but not much else 
about it. You should be somewhat informal in your writing, but above 
all be clear and write only complete sentences. You may use Z, as an 
example (but only as an example) if you feel it will help to make your 
explanation clearer. 


Following the general instructions of Exercise D2, explain what it means 
for addition in Z, to be well defined. 
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One way that mathematicians have been able to discover new facts is by 
extending or generalizing the results of a known theorem by conjecturing and 
then proving a result that verifies more cases of the known theorem or that 
proves a theorem of which the known theorem is a special case. For example, 
Euclid’s Theorem that there exist infinitely many prime numbers was general- 
ized by Dirichlet when he proved that there are infinitely many primes in any 
arithmetic progression. (See the comments at the end of Section 5.4.) 

In this section, we illustrate this approach to discovering new facts by 
looking at Theorem 1.4.4, which says that for every integer n, n is even if 
and only if n? is even. 

One way to extend this result would be to consider other exponents. For 
example, is it true that for every integer n, n is even if and only if n> is even? 
Let’s try a proof. It would be helpful to reread the proof of Theorem 1.4.4, 
since it seems reasonable to try a similar proof to that one. 

We will first prove that if n is even, then n> is even. So let n be an even 
integer. Then n = 2¢ for some integer ¢. Thus 


n= (2t) = 8 = 2(40°), 


which shows that 7? is even. 

We now wish to prove (or disprove) the converse; namely, if n> is even, 
then n is even. Recall that in Section 1.4, when we proved that if n’ is even 
then n is even, we actually proved the contrapositive of the converse; namely, 
if n is odd, then n? is odd. It makes sense to try that approach again. We 
will prove that if n is odd, then n° is odd. 

Suppose that n is an odd integer. Then n = 2t¢ + 1 for some integer ¢. 
We get 


n= (2+ 1) = 2(48 + 62? + 32) + 1. 


Hence n? is odd. We have now established that if 13? is even, then 7 is 
even. We may summarize our results in a proposition. 


For every integer n, n is even if and only if n° is even. 


It is possible to prove that n? even implies n even directly without proving 
the contrapositive. However, such a proof does require using some results 
from Section 5.4. We leave the solution as an exercise. 

The next logical extension of Theorem 1.4.4 is to exponent 4 and indeed 
it is true that for every integer n, n is even if and only if n‘ is even. The 
proof is similar to the proof of Proposition 5.6.1 and is left as an exercise. 

Now instead of trying more exponents, suppose we try to state and prove 
a general theorem that includes the ones we have already done. The obvious 
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generalization would be the following: if k is a positive integer, then for 
every integer n, n is even if and only if n‘ is even. 

A natural way to start a proof is to try to mimic the proof of Theorem 
1.4.4 or Proposition 5.6.1. We first prove that if m is an even integer, then 
n* is even. If n = 2t for some integer 4 then n* = (2r) = 2-2*'r*, an 
even number. 

To prove the converse, we will again mimic the proof of Theorem 1.4.4 
and prove its contrapositive; namely, if 1 is odd, then n‘ is odd. Let n be an 
odd integer. Then we can write n = 2t + 1, where t € Z. So n* = (2t + 1). 

Here we pause because our exponent k is unspecified. If we are to follow 
the proof of Theorem 1.4.4, we need to expand (2r + 1)‘. Fortunately, the 
Binomial Theorem (Theorem 5.2.7) will allow us to do that. We get 


k 
Q+1¥=> (*) Qn, 
i=0 

Each term of this sum except the last one has a factor of 2 in it and the 
last term is 1. Thus (2t + 1)* can be expressed as 2m + 1 for some integer 
m, proving that n* is odd. 

Our proof is now complete and we have the following result that general- 
izes Theorem 1.4.4 and Proposition 5.6.1. 


Let k be a positive integer. Then for every integer n, n is even if and only 
if n* is even. 


In proving Theorem 1.4.4 and Proposition 5.6.1, we needed to use only 
simple algebraic properties of the integers. But in proving the more general 
result in Theorem 5.6.2, we needed to use the Binomial Theorem. It is usually 
the case in mathematics that extending a theorem often requires using more 
sophisticated methods or additional theorems and lemmas. 

Theorem 5.6.2 can also be proved by induction. The details are left as 
an exercise. 

There is another way to extend the result of Theorem 1.4.4. To say that 
an integer n is even is equivalent to saying that it is divisible by 2. So it is 
natural to ask whether Theorem 1.4.4 can be extended to divisibility by 3; 
that is, is it true that for every integer n, n is divisible by 3 if and only if n° 
is divisible by 3? (We will stick with n’ for now and consider higher powers 
of n later.) 

To start out at least, let’s try following the proof of Theorem 1.4.4 since 
that approach has proved successful so far. The proof that if n is divisible 
by 3, then n’ is divisible by 3 is straightforward and is left as an exercise. 

Next we want to show that if n? is divisible by 3, then x is divisible by 3. 
Asin the proof of Theorem 1.4.4, we will prove the contrapositive; namely, 
if n is not divisible by 3, then n’ is not divisible by 3. 

In the case of divisibility by 2, to say 1 is not divisible by 2 means that 
nis odd or, equivalently, m = 2t + 1 for some integer ft. What is a comparable 
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statement for 1 to be not divisible by 3? Here the Division Algorithm helps. 
One consequence of it is that for every integer n, there exist integers ¢ and 
r such that n = 3t + r, where 0 =r < 3. In other words, we can write n = 
3t,n = 3t + 1, orn = 3t + 2, for some integer t¢. So if 1 is not divisible by 
3, then nm = 3t + 1 orn = 3t + 2 for some integer f. 

We have two cases to consider then. If n = 3t + 1, then 


nm = (3¢4+ 1% = 92 + 64+ 1 = 332? +241 
and if nm = 3t + 2, then 
n= (3¢ + 2 = 90 + 120+ 4 = 330 + 4¢+ D411. 


In both cases, we see that n’ is not divisible by 3. Thus we have proved 
the following: 


For every integer n, n is divisible by 3 if and only if n? is divisible by 3. 


Note: It is possible to prove directly, without using the contrapositive, 
that if n? is divisible by 3, then n is divisible by 3. (See Exercise 5.) 

The next case to consider is divisibility by 4. A quick look at examples 
and we see that the statement ‘“‘For every integer n, n is divisible by 4 if and 
only if n? is divisible by 4” is false. Let n = 2. Then rn’? = 4 is divisible by 4 
but 7 is not. 

So we are led to the following question: For what positive integers k is 
it true that for every integer n, n is divisible by k if and only if n’ is divisible 
by k? 

It is easy to see that one of the implications is true for every positive 
integer k; namely, for every integer n, if n is divisible by k, then n’ is divisible 
by k. We leave the proof as an exercise. 

It is the converse that is interesting. Let S(k) be the statement “‘For every 
integer n, if n’ is divisible by k, then n is divisible by k.’’ We would like to 
know for which positive integers k the statement S(k) is true. 

We know that S(k) is true for k = 1 (trivially), for k = 2 (Theorem 1.4.4), 
and for k = 3 (Proposition 5.6.3). It is false for k = 4. 

Since 2 and 3 are primes, we might conjecture that S(k) is true if k is 
prime and false if k is not prime. The following proposition states that S(k) 
is true if k is a prime. The proof is left as an exercise. A word of caution 
first. Unlike the proofs of Theorem 1.4.4 and Proposition 5.6.3, it is best to 
prove Proposition 5.6.4 directly and not prove its contrapositive. One or 
more results on primes from Section 5.4 will be helpful here. 


If k is a prime number, then for every integer n, if n’ is divisible by k, then 
n is divisible by k. 


Now what about composite values of k? We have only shown that S(k) 
is false for one value of k, k = 4. It would help to reinforce our conjecture 
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by looking at one or two more examples. Let’s try k = 6. If we compute the 
squares of all positive integers up to 50, we see that the only ones divisible 
by 6 are 36, 144, 324, 576, 900, 1296, and 1764. Each one of these squares 
is the square of a number divisible by 6. It would seem then that S(6) may 
very well be true. Let’s see if we can prove it. 

One approach is to prove the contrapositive of 5(6) as was done in the 
proofs of Theorem 1.4.4 and Proposition 5.6.3. We leave this method of 
proof as an exercise and instead prove S(6) directly. Suppose that n is an 
integer such that n’ is divisible by 6. It follows that n’ is divisible by 2 and 
by 3. Now since S(2) and S(3) are true, it follows that n is divisible by 2 and 
by 3. Because 2 and 3 are relatively prime, we can use Exercise 13 of Section 
5.3 to conclude that n is divisible by 6. So now we have the following: 


For every integer n, if n’ is divisible by 6, then n is divisible by 6. 


So now we have to revise our conjecture. We know that S(k) is true if 
k is prime but not always false if k is not prime. The next step might be to 
generalize Proposition 5.6.5. If we study the proof of Proposition 5.6.5, we 
see that the roles played by 2 and 3 could be replaced by any pair of distinct 
primes. Hence, we have the following result and again we leave the proof 
for the exercises. 


Let p and q be distinct primes and let k = pq. For every integer n, if n°’ is 
divisible by k, then n is divisible by k. 


Next, let’s consider how we may generalize Proposition 5.6.6 even more. 
A reasonable generalization would be to look at the product of more than 
two distinct primes. In fact, S(k) is true whenever k is a product of distinct 
primes. A rigorous proof will require a suitable generalization of Exercise 
13 of Section 5.3. In fact, a generalization of Exercise 12 of Section 5.3 is 
required first. We state these generalizations as lemmas and of course leave 
the details to you. Both proofs should be done by induction. 


Let a,, a, ..., Gn, b © Zand suppose that (a,, b) = (m, b) =... = 
(an, b) = 1. Then (aya)... Gn, b) = 1. 


Let a, a ..., G,, b € Zand suppose that a; | b for alli = 1, 2,..., m. 
Suppose also that (a;, a)) = 1 if i # j. Then aya)... am | b. 


Now you should be able to prove the following. (See Exercise 12.) 


Let pi, Po, ---, Pm be distinct prime numbers. Let k = pip)... Pm. For every 
integer n, if n’ is divisible by k, then n is divisible by k. 


Is it possible that S(k) is true for other k besides those k that are the 
product of distinct primes? We’ll try some examples. 
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S(8) is false since 4? is divisible by 8 but 4 is not. Similarly, 5(9) is false 
since 3° is divisible by 9 but 3 is not. In fact, S(k) is false for kK = 12, 16, and 
18 as well. (See Exercise 13.) 

Perhaps then Proposition 5.6.9 gives all values of k = 2 for which S(k) 
is true. This is indeed the case. 


S(k) is false if k is not the product of distinct primes. 


PROOF: We leave this as an exercise. (You may find it helpful to prove 
special cases of this result first. For example, first prove that S(k) is false if 
k = p’, where p is a prime, and then if k = p”, m = 2.) ©» 


We summarize these results in the following theorem. 
If k = 2, then S(k) is true if and only if k is a product of distinct primes. 


We conclude with one more generalization. If k and tare positive integers, 
let S(k, t) be the statement ‘“‘for every integer n, n is divisible by k if and 
only if ' is divisible by k.”’ In Theorem 5.6.3, we proved that S(2, f) is true 
for all positive integers t. In Theorem 5.6.11, we proved (or rather you will 
prove) that S(k, 2) is true if and only if k is a product of distinct primes. 

As a final exercise, formulate and prove the appropriate theorem for 
S(k, t) in general (Exercise 15). 


a” HISTORICAL COMMENTS: FERMAT’S LAST THEOREM 


One of the most famous generalizations in all of mathematical history con- 
cerns Pythagorean triples. As we saw in Section 5.3, the equation x? + y? = 
z’ has infinitely many solutions in positive integers. One generalization of 
this result would be to look at the equation x? + y? = z? and ask whether 
there are any solutions to this equation in positive integers. That equation 
and the more general one x" + y” = z" have a fascinating history. 

A Latin translation of Diophantus’ Arithmetica was published by Claude 
Gaspard de Bachet (1581-1638) in 1621. A copy of this translation became 
available to Fermat. In a page margin of his copy of Bachet’s translation, 
Fermat claimed that the equation x” + y” = z” has no solution in positive 
integers. He wrote that he had a wonderful proof of this fact but that the 
margin was too small to contain it. No record of Fermat’s ‘‘proof’’ has ever 
been found. Fermat’s claim became known as Fermat’s Last Theorem and 
its history comprises one of the most interesting stories in all of mathematics. 

With one exception, Fermat never published any proofs of his work. That 
exception was a proof that the equation x‘ — y’ = z? has no solutions in 
positive integers. An easy corollary of this result is the n = 4 case of Fermat’s 
Last Theorem. Fermat’s proof is elementary enough to be included in most 
introductory books on number theory. The case n = 3 of Fermat’s Last 
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Theorem was essentially proved by Euler. Euler’s proof is not elementary 
in the sense that it requires more sophisticated techniques than the n = 4 
case requires. 

One can show that if Fermat’s Last Theorem is true for odd prime 
exponents p and also for n = 4, then it is true for every exponent n. (See 
Exercise 17 following.) So the challenge was to prove that if p is an odd 
prime then the equation x? + y’ = 2? has no positive integer solutions. 

In the 1820s, independent proofs of Fermat’s Last Theorem for p = 5 
were given by both Dirichlet and Adrien-Marie Legendre (1752-1833) and 
in 1839, Gabriel Lame (1795-1870) gave a proof for p = 7. 

The most significant work on Fermat’s Last Theorem in the 19th century 
was done by Ernst Kummer (1810-1893) and the Bernoulli numbers that 
we discussed in Section 5.2 played an important role. In 1847, Kummer 
proved that if p is an odd prime and p does not divide the numerator of any 
of the Bernoulli numbers B,, B,,..., B,-3, then Fermat’s Last Theorem is 
true for the prime p. 

Primes that satisfy the conditions of Kummer’s Theorem are called regular 
primes. A prime that is not regular is called irregular. It is easy to check 
that 3, 5, and 7 are regular primes. In fact, the first irregular prime is 37 
since the numerator of By is 7709321041217, which is divisible by 37. In a 
random list of primes, regular primes seem to occur roughly three-fifths of 
the time. But, oddly enough, although it is known that there are infinitely 
many irregular primes, it is not known if there are infinitely many regular 
primes. 

Over the next 100 years or so various methods were developed to prove 
special cases of Fermat’s Last Theorem, so that by 1975 it was established 
to be true for all prime exponents less than 125,000. These methods centered 
on analyzing what are called cyclotomic integers, which are complex numbers 
involving roots of unity. (Complex numbers and roots of unity are discussed 
in Section 7.3.) 

Also over the years, many mathematicians, famous and amateur alike, 
thought that they had proven Fermat’s Last Theorem. But until 1994, every 
one of these proofs had a flaw in it. 

Then in 1986 the history of Fermat’s Last Theorem took a surprising 
turn. A German mathematician named Gerhard Frey published a paper in 
which he claimed that Fermat’s Last Theorem would follow from a conjecture 
about elliptic curves. Elliptic curves are given by equations of the form y* = 
ax? + bx? + cx + d where the cubic polynomial on the right-hand side 
has no repeated roots. The conjecture that Frey was referring to was the 
Shimura—Taniyama—Weil conjecture, a statement about elliptic curves with 
rational coefficients that is very difficult to state in simple terms so we will 
make no attempt here. What was stunning to the mathematical community 
was that Frey’s approach was new and totally unexpected. However, there 
were few details in Frey’s paper, so work had to be done to give a rigorous 
proof of his claim. This was done by Ken Ribet, who proved in 1990 that if 
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the Shimura—Taniyama—Weil conjecture was true for so-called semistable 
elliptic curves, then Fermat’s Last Theorem was also true. Then in the 
summer of 1993, Andrew Wiles announced that he had a proof of the 
Shimura—Taniyama-—Weil conjecture for semistable elliptic curves and hence 
a proof of Fermat’s Last Theorem. The proof was long and difficult, so it 
was necessary for mathematicians to check Wiles’ work. Alas a mistake, a 
serious one, was found in subsequent months. Wiles worked for a year on 
the gap in his proof and with the help of Richard Taylor was able to clear 
up all difficulties. Finally, after more than 350 years, the first correct proof 
of Fermat’s Last Theorem was given! 


Exercises 5.6 


1. Prove the statement ‘‘For every integer n, if n° is even, then n is even”’ 
without proving its contrapositive. 


2. Prove that for every integer n, n is even if and only if n‘ is even. 
3. Use induction to give another proof of Theorem 5.6.2. 


4. Prove that for every integer n, if n is divisible by 3, then n’ is divisible 
by 3. 


5. Prove the statement “For every integer n, if n’ is divisible by 3, then n 
is divisible by 3” without proving its contrapositive. 


6. Let k be a positive integer. Prove that for every integer n, if n is divisible 
by k, then n7’ is divisible by k. 


7. Prove Proposition 5.6.4. 
8. Prove S(6) by proving its contrapositive. 
9. Prove Proposition 5.6.6. 
10. Prove Lemma 5.6.7. 
11. Prove Lemma 5.6.8. 
12. Prove Proposition 5.6.9. 
13. Prove that S(k) is false for k = 12, 16, and 18. 


14. (a) Prove that S(k) is false if k = p? where p is a prime. 
(b) Prove that S(k) is false if k = p” where p is a prime and m = 2. 
(c) Prove Proposition 5.6.10. 


15. State and prove a theorem that classifies all k andt for which S(k, t) is true. 


16. Prove that if the equation x* — y* = z* has no solutions in positive 
integers, then Fermat’s Last Theorem is true for n = 4. 


5.6 


17. 


18. 


19. 
20. 
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Prove that if Fermat’s Last Theorem is true forall odd prime exponents 
p and also for n = 4, then it is true for every exponent n. 


Prove that if the equation x” + y" = z” has no solutions in positive 
integers, then it has no solutions in positive rational numbers. 


Verify that 3, 5, 7, and 11 are regular primes. 


Prove that there is no right triangle with integer sides whose area is a 
perfect square. 

Hint: First prove that if such a triangle exists, then there must be such a triangle 
whose sides form a primitive Pythagorean triple. Then use Theorem 5.3.10 and 
Exercise 11 (da) of Section 5.4 together with Fermat’s result that the equation x* — 
y! = 2° has no solutions in positive integers to derive a contradiction. 


INFINITE SETS 


Introduction 


We have studied many different sets in this text. Some of them have a finite 
number of elements; the set {1, 2,... , 10} for example. But most of the 
interesting sets we have looked at have an infinite number of elements, the 
sets Z, Q, and R being the most important examples. In the 19th century, 
mathematicians attempted to introduce rigor into calculus. In doing so, they 
undertook a comprehensive study of the properties of the real numbers. As 
the theory advanced, it soon became clear that infinite sets in general had 
to be studied. But the very idea of a set with an infinite number of elements 
was resisted for a long time, mainly because it seemed to lead to contradic- 
tions. 

If A and B are finite sets, it is usually easy to determine which of the 
sets has the greater number of elements; we just count the number of elements 
in each set. Now suppose that A and B are infinite sets. Is there a sense in 
which one of these sets is ‘‘bigger’’ than the other? Clearly, counting elements 
is not a viable method in this case. However, there is another way to show 
that two finite sets A and B have the same number of elements: find a 
bijective function with domain A and codomain B. It follows from Exercise 
27(a) of Section 3.2 that if such a function exists, then |A| = |B|. By analogy, 
we could say that two infinite sets have the ‘‘same size”’ if there is a bijection 
between them. 

Here are two examples that illustrate the fact that the notion of two 
infinite sets having the same size can run counter to our intuition. 

Let C,; and C, be two concentric circles. We can establish a bijection 
between C, and C, by associating a point p; on C, with a point p, on C; if 
P, and p, lie on the same radial line from the common center of the circles. 
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By our definition, then, we would conclude that C; and C, have the ‘‘same 
size.”’ However, one of the concentric circles will clearly have a smaller 
circumference than the other. 

As another example, consider the set Z of integers and the set E of even 
integers. The function f: Z — E defined by f(n) = 2n is a bijection between 
Z and E. 


Thus we have the seeming contradiction of a set having the same size as 
one of its proper subsets. 

Gradually, as the 19th century came to a close, these examples came 
to be seen as paradoxes rather than actual mathematical contradictions. 
Paradoxes, in this context, are seemingly contradictory statements that are 
actually true. In the last example, if we accept the fact that two sets having 
the same size means that there is a bijection between them, then it is a true 
statement to say that Z and E have the same size. 

It was the mathematician Georg Cantor (1845-1918) who in a series of 
papers put the theory of infinite sets on a firm logical foundation. We will 
discuss many of Cantor’s ideas in this chapter. 


COUNTABLE SETS 


Numerically Equivalent Sets 


Cantor distinguished infinite sets by their ‘‘size’’ in the same way we dis- 
cussed, by labeling sets as having the ‘“‘same size” if there is a bijection 
between them. We will call such sets numerically equivalent. The formal 
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definition is given next. As a result of this definition we will be able to give 
a formal definition of what it means for a set to be finite. 


Definition 6.1.1 Let A and B be sets contained in some universal set 
U. We say that A and B are numerically equivalent if there exists a 
bijection f: A — B. We say that A and B have the same cardinality. 


If A and B are numerically equivalent, we write A ~ B. 

The definition of numerical equivalence actually defines an equivalence 
relation on the power set of U. The proof of this fact is left as an exercise. 
Specifically, we have the following: 


1. For every set A, A =~ A. 
2. For all sets A and B, if A ~ B, then B = A. 
3. For all sets, A, B, and C, ifA ~ Band B=C, thenA=C. 


Definition 6.1.2 Let A be a set contained in a universal set U. We say 
that A is a finite set if A = @ or if A = {1, 2, ... , n} for some positive 
integer n. The integer n is the number of elements of A and is denoted 
by |A|. 

A set that is not finite is called an infinite set. 


Note that if A and B are finite sets, then A ~ B if and only if |A| = |Bl. 
We leave the formal proof as an exercise. 

The cardinality of a set is a measure of the size of a set. So, for example, 
two finite sets have the same cardinality if and only if they have the same 
number of elements. The cardinality or cardinal number of a finite set is the 
number of elements in the set. Every nonnegative integer then is a cardinal 
number. (0 is the cardinality of @.) 


Countable Sets 


The most natural example of an infinite set, the one we first encounter in 
arithmetic, is the set Z* of positive integers. So we will first look at sets that 
are numerically equivalent to Z*. Note that since Z* has no largest element, 
it must be an infinite set. (See Exercise 18 of Section 5.1.) 


Definition 6.1.3 Any set A that is numerically equivalent to Z* is called 
a countably infinite set. A set that is either finite or countably infinite 
is called countable. A set that is not countable is called uncountable. 
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Any two countably infinite sets then have the same cardinality. We denote 
this cardinality by Xo (read ‘“‘aleph zero” or “‘aleph naught’’). Xo is our first 
example of an infinite cardinal number. 


The set Z of all integers is countably infinite. To prove this we must find a 
bijective function f: Z— Z*. One possible way to do this is to map the positive 
integers to the even positive integers and map the nonpositive integers to 
the odd positive integers. The function defined by 


2n, ifn >0 


fin) = | 


—2n +1, ifn <0 


is such a function. We leave as an exercise the fact that f is bijective. 


In light of the previous example and the discussion at the beginning of this 
chapter, it should seem quite plausible that the set E of even integers is 
countably infinite. To prove this fact, it would seem necessary to find a 
bijective function f: E > Z*. But since Z* ~ Z by Example 1, and since 
numerical equivalence is an equivalence relation, it suffices to show that 
there is a bijection function f: Z — E. The function f(n) = 2n is such a 
function. Therefore E is countably infinite. 


The previous examples suggest that any infinite set of integers is countably 
infinite. In fact, any infinite subset of a countably infinite set is countably 
infinite. 


Let A be a countably infinite set and let B be an infinite subset of A. Then 
B is countably infinite. 


PROOF: Since A is countably infinite, there exists a bijection f: Z* — A. To 
prove that B is countably infinite, we will define a bijective function g: 
Z* — B. To do this, we first inductively define a collection of nonempty 
subsets of Z*. 

Let S; = {i € Z* | f(i) € By}. S, is the set of positive integers that get 
mapped to B by the function f. Since f is surjective, S; is an infinite subset 
of Z*. Next we define S, as follows: by the Well-Ordering Principle, S, has 
a smallest element that we will call k,. So we let S, = S, — {k,}. Now suppose 
that we have defined n — 1 nonempty subsets of Z*, S;, S.,..., S,-1. By 
the Well-Ordering Principle, each S; has a smallest element. Call it k;. We 
define S, = S, — {k,,ky,..., k,-1}. Then S, is nonempty since B is an infinite 
set. Let k, be the smallest element of S,. Now we define g: Z* — B by 
g(n) = flkn). 

Note that the sequence of integers k,, k), ... has the property that if 
n <m, then k, < k,,. We now show that g is a bijective function. To see 
that g is injective, let n, m © Z* and suppose that g(n) = g(m). Then 
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f(kn) = f(km). Since f is injective, we have k, = k,,. It now follows from the 
above remark that n = m. Therefore, g is injective. 

To prove that g is surjective, let b © B. We must find n € Z* such that 
g(n) = b. Since f is surjective, there exists t € Z* such that f(t) = b. Note 
that t © S; since f(t) € B. Let m be the number of integers in S, that are 
less than ¢. If there are no such integers, that is, if m = 0, then t = k,. 
Otherwise the integers that are less than ¢ are k,, kn, ..., Km. Since Sy4) = 
S, — {k,, ky, ..., Km}, t must be the smallest element of S,,.; and hence t = 
Km+1. Thus if m = 0 or not, g@m + 1) = f(t) = b. This proves that g is surjective. 

Since g is a bijection, it follows that B is countably infinite. ©» 


Every subset of Z is countable. 


We have seen that a set A is countably infinite if there is a bijection from 
Z* to A. Sometimes it may be difficult or cumbersome to define such a 
function. In our next result, we prove that we only have to find a surjective 
function from Z* to A. 


Let A be an infinite set. Suppose there exists a surjection f: Z* — A. Then 
A is countably infinite. 


PROOF: We will inductively define a bijective function g: Z* — A. 

Let g(1) = f(1). Let m be the smallest positive integer such that f(m) # 
g(1). Let g(2) = f(m). Now suppose that k € Z* and g(1), g(2),..., 
g(k — 1) have been defined. Let n, be the smallest positive integer such 


that f(n,) € A — {g(1), g(2),..., g(k — 1)}. We define g(k) = f(n,). This 
definition gives a function from Z* to A. From the definition of g, g(k) # 
g(i) for any i = 1, 2,..., k — 1. Thus g is injective. 


We now prove that g is surjective. Let a € A. Since f is surjective, a is 
in the image of f. Let t be the smallest positive integer such that f(t) = a. 
Then f(t) ¥ f(i) for any i = 1, 2,...,t — 1. Let r be the number of distinct 
elements from among f(1), f(2), ... , f(t — 1). Then g(1), g(2), ... , g(r) are 
equal to f(1), f(2), ... , f(t — 1) in some order. 

Hence ft is the smallest positive integer such that f(t) © A — {g(1), 
g(2),..., g(r)}. It follows from the definition of g that g(r + 1) = f(t) =a 
proving that g is surjective. 

We have thus constructed a bijection from Z* to A and so A is countably 
infinite. @> 


We next relate the idea of a countable set with the notion of a sequence. 


Definition 6.1.7 Let U be some universal set. A sequence of elements 
of U is a function f: Z* > U. 
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If f: Z* + Uis a sequence and we write a, = f(n) for each n € Z’, then 
the sequence can be expressed in the form {a;, a, a3, ...} or more simply 
{a,};-,. Note that the elements of the sequence are not necessarily distinct. 
In fact, they would all assume the same value if f is a constant function. 

Note however that the image of f is a countable set. That fact follows 
from Theorem 6.1.6. 

Now let A be a countably infinite set and let f: Z* — A be a bijection. 
Let a, = f(n) for each n € Z*. Then the elements of A can be identified 
with the sequence {a), a), a;,...} with no elements repeated; that is, a; 4 a; 
if i  j. 

Conversely, if the elements of a set can be written as a sequence, then 
that set is finite or countably infinite. 


eae a ft 
ceca 


and therefore A is countably infinite. Notice that this gives us an example 
of a countably infinite set that is also a bounded set: every element of A is 
in the closed interval [0, 1]. 


ne 2 The elements of A form a sequence 


Unions of Countable Sets 


We now consider the question of whether there are ‘bigger’ sets than Z 
that are countable. More specifically, are there countable sets of real numbers 
that contain the set Z as a proper subset? For example, our intuition would 
tell us that the set A = Z U {V2} is a countable set. In fact, if S is any finite 
set of real numbers, then the set Z U S is countable. We leave the proof as 
an exercise. (See Exercise 7.) 

Now what about the union of Z with another countably infinite set? Such a 
union will be countably infinite. More generally, we have the following result: 


Let A,, A;,..., A; bea finite collection of countably infinite sets, all contained 
in some universal set U. Then A; U A, U... U A, is a countably infinite set. 


PROOF: Since each A; is countably infinite, there is a bijection f;: Z* > A; 
for each i = 1, 2,...,k. Let A = A,; UA, U...U A,. We will define a 
surjective function f: Z* — A. It will then follow from Theorem 6.1.6 that 
A is countably infinite. 


We let 
tL) =f10) 
f(2) = fd) 
fk) = fi) 


fk +1) = fi2) 
f2k) = fil2) 
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An explicit formula for f can be given as follows. Let n € Z*. Using a 
modification of the Division Algorithm, we can write n = mk + r where m 
is a nonnegative integer and r is an integer such that 1 = r = k. Moreover, 
rand m are unique. Then f(n) = f(mk + r) = f(m + 1). 

Now f is not necessarily injective but it is surjective. To see this, let a © 
A. Then a € A; for some i. Since f; is surjective, there exists n © Z* such 
that f(n) = a. Then f((n — 1)k + i) = fi(n) = a. Therefore f is surjective 
and our proof is complete. ~* 


The Rationals Are Countable 


We now consider the set Q of rational numbers. At first glance it seems to 
be a “much bigger” set than Z. As we will see in Chapter 7, between any 
two real numbers there are infinitely many rational numbers. This means 
that any interval on the real line, no matter how small, will contain infinitely 
many rational numbers. Despite this fact, the rational numbers do form a 
countable set. 


The set Q of rational numbers is countably infinite. 


PROOF: We will show that the set Q* of positive rationals is countably infinite. 
Since it is easy to define a bijection from Q* to Q,, it will follow that 
the set Q of negative rationals is also countably infinite. Since Q = 
Q* UQ U {0}, Theorem 6.1.8 implies that Q is countably infinite. 

Any element of Q* can be written as a/b where a, b € Z*. So we can 
write the elements of Q* in a rectangular array where the elements in a 
given row all have the same denominator; that is, the numbers in row 1 have 
denominator 1, the numbers in row 2 have denominator 2, the numbers in 


row 3 have denominator 3, ... , and so on. It will look like this: 
1 2 3 4 
1 1 1 Vive 
1 2 3 2 
2 2 2 Diets 
1 2 3 4 
3 3 3 Bees 
1 2 3 4 
4 4 4 4.. 


Note that elements of Q* appear more than once in this listing. In fact 
each element appears an infinite number of times. We define f: Z* — Q* 
by going through this array diagonally: 
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2 3 4 
1 ees i. 
bf ea fei 
L 2 So bik 
2 2 2 
ra a, 
3 3 
ae 
4 
That is, we can define: 
=p I=3 IO=f Y=] M9) =5 
1 2 


f)=F  AD=Z fB)=F... 


We will forego giving an explicit formula for f since it is complicated. 
But it is clear from the construction of f that every positive rational is in the 
image of f. It is now a consequence of Theorem 6.1.6 that Q* is countably 
infinite. We can now conclude that Q is a countably infinite set. 


Any subset of Q is countable. 


Cartesian Products of Countable Sets 


Another set that is “bigger” than Z is the Cartesian product Z x Z, the set 
of all ordered pairs of integers. In view of the examples we have done so 
far, it should not be surprising that Z x Z is a countable set. In fact if A 
and B are any countably infinite sets, then the Cartesian product A X B is 
countably infinite. One proof of that fact uses an argument similar to the 
diagonal argument of Theorem 6.1.9. We leave the details of this proof as 
an exercise. (See Exercise 12.) We give another proof next that uses the 
Unique Factorization Theorem for Integers. 


Let A and B be countably infinite sets contained in some universal set U. 
Then A X B is countably infinite. 


PROOF: Since A and B are countably infinite, there exist bijections g: Z* > 
A and h: Z* = B. For each i € Z’, let a; = g(i) and b; = h(i). Since g is 
injective and surjective, every element of A can be written uniquely in the 
form a; for some i € Z*. Similarly, every element of B can be written uniquely 
in the form b; for some j € Z*. 
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Define f: A X B > Z* by f(a;, b;) = 2'3/. We first show that f is injective. 
Suppose that (a;,, b;) and (a, b;) are in A X B and that f(a;, b;) = fla, b)). 
Then 2'3/ = 243. It follows from the Unique Factorization Theorem for 
Integers that i = k andj = /. Hence (a, b;) = (a,, b,) and we can conclude 
that fis injective. 

Clearly f is not surjective because the image of f consists of only those 
positive integers that can be written as a power of 2 times a power of 3. But 
the image of f is an infinite subset of Z, so by Corollary 6.1.5, Im fis countably 
infinite. Therefore, A x B is numerically equivalent to a countably infinite 
set, namely Im f and is thus itself countably infinite. “» 


Note that using the primes 2 and 3 in this proof was not essential. 
Any pair of primes would suffice. Similarly, by using three primes, one 
can show that the Cartesian product of three countably infinite sets is 
countably infinite. In general, we have the following result, whose proof 
is left as an exercise. 


Let A,, A>,..., A, bea finite collection of countably infinite sets, all contained 
in some universal set U. Then the Cartesian product A; X A) X... X A, is 
a countably infinite set. 


So far we have only looked at infinite sets that are countable. There are 
indeed infinite sets that are not countable: the set R of real numbers is the 
outstanding example. In the next section, we will prove that R is not countable 
and see other examples of uncountable sets. 


=’ HISTORICAL COMMENTS: CANTOR AND THE INFINITE 


It might be said that Georg Cantor introduced the infinite to mathematics. 
Certainly, mathematicians had considered infinity previously in many differ- 
ent ways; for example, the ancient Greeks had computed areas by exhausting 
a region with infinitely many simple geometric figures, and by Cantor’s time, 
the notion of limit was being made rigorous. But Cantor treated infinite 
objects as ‘“‘completed”’ (the terminology is due to Gauss), so that their sizes 
could be measured and compared. In fact, as we will see in the next section, 
he concluded that different sizes of infinity existed. 

This point of view was very controversial in the mathematical community. 
Among Cantor’s many critics were Henri Poincaré (1854-1912), who re- 
marked that one should ‘‘never introduce objects that one cannot define 
completely in a finite number of words,’ and the similarly acid Leopold 
Kronecker (1823-1891). Cantor went so far as to blame Kronecker, a profes- 
sor at the University of Berlin, for his own failure to rise above his position 
at the less prestigious University of Halle. 

Sadly, Cantor may have let such feelings of persecution dominate his 
later life. He suffered several break downs and died in a hospital in 1918. 
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His mathematics, however, has stood the test of time, and his place in history 
was elegantly foretold by David Hilbert (1862-1943), who called Cantor’s 
work “‘the most astonishing product of mathematical thought” and predicted 
that ‘“No one shall expel us from the paradise which Cantor has created 
for us.” 


Exercises 6. 


nA F&F W N 


10. 


11. 


. Let Ube some universal set. Prove that numerical equivalence, as defined 


in Definition 6.1.1, defines an equivalence relation on P(U). 


. Prove that if A and B are finite sets, then A ~ B if and only if |A| = |B). 
. Prove that the function defined in Example 1 is bijective. 

. Prove that Z ~ Z* by finding a bijective function g: Z* > Z. 

. Let Z be the set of negative integers. Prove that Z~ ~ Z* by finding 


a bijective function f: Z* — Z. Prove that your function is bi- 
jective. 


. Prove that the following sets S are numerically equivalent to Z by explic- 


itly defining a bijective function f: Z — S. Then prove that your function 
is indeed bijective. 

(a) S = O, the set of odd integers 

(b) S = nZ, the set of multiples of a fixed positive integer n 

(c) S={n € Z|n = 3t + 1 for some t € Z} 

(d) S = {x € Z| x > 100} 

(ec) S= ZU {V2} 

(f) S = {x ER |sinx = 0} 

(g) S = Z x {0} 

(h) S = Z~x {-1, 1} 


. Prove that if S is any finite set of real numbers, then the set Z U S is 


countably infinite. 


. (a) Prove that if A is a finite set and S C A, then S is a finite set. 


(b) Prove that if A and B are finite sets, so are A U Band AN B. 
(c) Prove that if A is an infinite set and B is a finite set, then A — B is 
an infinite set. 


. Let A, B, C, D be subsets of a universal set U. Suppose that A ~ B and 


C = D. Prove that A X C= B X D. 
Compute the values f(10), f(15), and (20) of the function defined in the 
proof of Theorem 6.1.9. 


Prove that each of the following sets is countably infinite. Use the results 
of this section rather than constructing an explicit bijection to Z*, and 
remember to show that the set is not finite. 
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13. 


14. 


15. 


16. 


17. 
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(a) QN (0, 1] 
(b)Q-Z 
(c) QU fe" |x € Z 


(d) The set of ordered pairs (» *) wheren € Z* andk = Oor1 


(e) The set of prime numbers 
(f) The rational points on the unit circle: 


{(x, y) |e + y= 1,x€Q,y © Q} 

Hint: Pythagorean Triples! 

(a) Give another proof of Theorem 6.1.11 using an argument similar to 
the one given in the proof of Theorem 6.1.9. 

(b) Give a proof of Theorem 6.1.12 along the lines of the proof of 
Theorem 6.1.11 given in the text. 

(c) Use induction and Theorem 6.1.11 to give another proof of Theo- 
rem 6.1.12. 


Still another proof of Theorem 6.1.11 goes as follows. Using the notation 
of the proof of Theorem 6.1.11, define fi: A x B — Z* by fi(a, 6) = 
2'1(27 — 1). Verify that f, is a bijection. 


Let A and B be sets contained in a universal set U. Suppose there exists 
an injection f: A — B. Prove that if B is countable, then A is countable. 


Let A and B be sets contained in a universal set U. Suppose there exists 
a surjection f: A — B. Prove that if A is countably infinite, then B 
is countable. 


Prove that Q is countably infinite by finding a surjection f: Z Xx Z>Q 
and applying Exercise 15. 


Prove that the set of all closed intervals [a, b] where a,b € Q isa 
countably infinite set. 


Discussion and Discovery Exercises 


D1.(a) Imagine that you owned a hotel with a countably infinite number 


of rooms. Suppose that on a given day, every room was taken. A 
new guest arrives. Explain how you could provide the new guest 
with a room of her own while still providing the guests already there 
with rooms of their own. 

(b) Then suppose that a countably infinite number of new guests arrive. 
How could the owner find a room for each new guest while again 
making sure that all of the current guests have their own rooms? 
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UNCOUNTABLE SETS, CANTOR’S THEOREM, 
AND THE SCHROEDER-BERNSTEIN IT HEOREM 


In this section we will study infinite sets, such as the real numbers, that are 
uncountable. Before beginning, though, we must recall some simple facts 
about the decimal expansion of real numbers. 

We will assume that any nonnegative real number x can be written in 
decimal form 


a an a3 
X= 4.003... =a +--+ 75+... 
10 10? 10° 
where dp, 4, 4), 43, .. . are nonnegative integers and a,, a), a3,...are between 


0 and 9, and that every such expansion in fact represents a real number. 
Such an expansion is unique except for expansions like 


5 = 5000... = 4999... 


That is, the decimal representation is unique unless the expansion ends in 
an infinite string of 9s. (To see this, notice that if these expansions represent 
different real numbers, there would be a real number between them. What 
could its expansion be?) 

To ensure uniqueness, then, we will adopt the following convention: we 
will not use decimal expansions ending in an infinite string of 9s. 

One more thing: there is nothing special about the decimal expansion. 
We could expand using any positive integer greater than 1, not just using 
10. The most important case is when the base is 2. This is called the binary 
expansion, and it uses only the digits 0 and 1. Of course, the same problem 
arises: the binary representation of a real number is unique only if we agree 
not to use expansions ending in an infinite string of 1s. 


Uncountable Sets 

With all this in hand, we are now able to prove that some very familiar sets 
are uncountable. 

The interval [0, 1) is uncountable. 


PROOF: The proof is by contradiction. We assume that [0, 1) is countable 
and derive a contradiction. If [0, 1) is countable, then there is a bijection f: 
Z* — [0, 1). For each i € Z’, let x; = f(i). Now by the previous remarks, 
each x; has a decimal expansion of the form 


xX; = O.aiah... 


where aj, a)... are integers between 0 and 9. 
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We will derive our contradiction by constructing a real number x in 
[0, 1) that is not equal to any x;. 

Let x = 0.b,b,b; ... where b; = 0 if ai = 1 and b; = 1 if ai # 1. This 
means that x differs from x, in the first decimal place, differs from x, in the 
second decimal place, differs from x; in the third decimal place, and so on. 
Hence x #-x; for all i. In other words, x ¥ f(i) for any i € Z*. But this 
contradicts the fact that f is surjective. Thus [0, 1) cannot be countable and 
therefore must be uncountable. ©» 


The closed interval [0, 1] and the set R of real numbers are uncountable. 


PROOF: If [0, 1] were countable, then by Theorem 6.1.4, any subset of 
[0, 1] would be countable. In particular, this would make [0, 1) countable, 
contradicting Theorem 6.2.1. Therefore, [0, 1] is uncountable. Similarly, the 
set R is uncountable. «> 


Since we now know that there are subsets of R that are uncountable, we 
will consider the question of the numerical equivalence of such sets. For 
example, it is not hard to prove that every closed interval [a, b] is numerically 
equivalent to [0, 1]. It follows that any two closed intervals [a, b] and [c, d] 
are numerically equivalent. (See Exercise 1.) Similarly, any two bounded 
open intervals are numerically equivalent. (See Exercise 2.) 


The function f(x) = tan x is a bijection from the open interval (- z) to 


the real numbers R. Hence any bounded open interval is numerically equiva- 
lent to R. 


Every closed interval [a, b] is numerically equivalent to any open interval 
(c, d). This fact can be proven fairly easily using the Schroeder—Bernstein 
Theorem, which will be proved later in this section. 


Any two sets that are numerically equivalent to R have the same cardinal- 
ity. We denote this cardinality by c, called the power of the continuum. c is 
our second example of an infinite cardinal number. It follows from the 
previous remarks that every open or closed interval of real numbers has 
cardinality ce. 


Cantor’s Theorem 


Since there is a natural ordering of the positive integers, it is possible to 
determine the larger of two finite cardinal numbers. What about the infinite 
cardinals? Since Xo is the cardinality of Z and c is the cardinality of R, it 
makes sense that X, be a “‘smaller’’ cardinal number than ¢ because Z is a 
“smaller” set than R: Z is a subset of R and Z is not numerically equivalent 
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to R. We are led to the following definition of what it means for one set to 
be ‘“‘smaller’ than another. 


Definition 6.2.3 Let A and B be sets contained in some universal set 
U. We say that A < B if there is an injection f: A — B but no bijection 
from A to B. 

We say that A <x Bif A < B or if A is numerically equivalent to B. 


Now if we have two cardinal numbers, say x and y, we will say that x < 
y if x is the cardinality of set X, y is the cardinality of a set Y, and X < Y. 

Similarly, we will say that x x y if x is the cardinality of set X, y is the 
cardinality of a set Y, and X < Y. 

For these definitions to make sense, however, they should not depend on 
the choice of the sets X and Y. In other words, we need the following lemma. 


Let A, B, C, D be sets contained in a universal set U. Suppose that A ~ B, 
C= D. If A < C, then B < D. If A x C, then B X D. 


PROOF: See Exercise 6 for the proof. «= 
So we can now say that Ny < ¢. 


We might ask if there are any infinite cardinal numbers x such that x < 
NX). The answer is no since it can be shown, although it is difficult, that every 
infinite set contains a countably infinite subset. 

Are there other infinite cardinal numbers besides 8,» and c? The answer 
is yes, and in fact there are an infinite number of infinite cardinals. The 
following result, known as Cantor’s Theorem, explains why. 


Cantor’s Theorem Let A be a set contained in a universal set U. Let P(A) 
denote the power set of A. Then A < P(A). 


PROOF: We first need to show that there is an injection f: A — P(A). Such 
an injection must map elements of A to subsets of A. There is a natural way 
to do this. Let a € A and map a to the subset {a}; that is, f(a) = {a}. It is 
easy to see that f is injective: for if f(a) = f(b) for some a, b € A, then 
{a} = {b} and hence a = b. 

Now we will show that there is xo surjective function from A to P(A). 
Suppose, on the contrary, that there is a surjective function g: A — P(A). 
We will derive a contradiction. 

If a € A, then g(a) is a subset of A. Now the element a may or may not 
be an element of this subset; that is, for some a, we may have a € g(a) and 
for some a, we may have a € g(a). 

Let B = {a € A|a € g(a)}. B is the set of elements a of A that get 
mapped to a subset of A that does not contain a. 
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Since g is surjective and B is a subset of A, B must be in the image of 
g. This will be true even if B is the empty set. So there exists an element ay 
in A such that g(a)) = B. Now we consider the question: is aj € B or not? 
Here is where we will derive a contradiction by showing that either possibility 
leads to an absurdity. 

Suppose a, € B. Then, by definition of B, a) € g(a). But g(a) = B, so 
we are led to the contradictory statement that if a, © B, then a ¢ B. 

Now suppose that a ¢ B. Then it follows that ay © g(a) = B. So if 
a) € B, then a © B! There is no way out of this contradiction except to 
conclude that B is not in the image of g, contradicting the fact that g is 
surjective. Therefore, no such surjective function can exist. 

We have thus proved that A < P(A). «= 


The Continuum Hypothesis 


Question: Is there an infinite cardinal x such that X%) < x < e? This problem 
has a long and interesting history. Cantor conjectured that no such infinite 
cardinal x exists. His conjecture has come to be known as the Continuum 
Hypothesis. One consequence of the Continuum Hypothesis is that any 
subset of the real numbers is either countable or is numerically equivalent 
to R. At the International Congress of Mathematicians in Paris in 1900, the 
eminent mathematician David Hilbert (1862-1943) gave an address in which 
he outlined the outstanding mathematical problems that he wished to see 
solved in the 20th century. The first problem on his list was the Contin- 
uum Hypothesis. 

The first real progress on the Continuum Hypothesis was made by Kurt 
Gédel (1906-1978) in the 1930s when he proved that the Continuum Hypoth- 
esis was consistent with the axioms of set theory. (For a brief discussion of 
these axioms, see Section 6.3.) In other words, using the commonly accepted 
axioms on which the theory of sets is based, it is impossible to disprove the 
Continuum Hypothesis. However, in 1963, Paul Cohen (1934— ) proved 
that the negation of the Continuum Hypothesis is also consistent with the 
axioms of set theory. Thus the Continuum Hypothesis is independent of 
the axioms of set theory and can be neither proved nor disproved from 
those axioms! 


The Schroeder-Bernstein Theorem 


Note that < is a relation on P(U), where U is a universal set. But what kind 
of a relation is it? 

According to Exercise 8(b), like the corresponding relation on Z, < is 
reflexive and transitive but not symmetric. It is too much to hope that < be 
antisymmetric: if A x B and B < A, there is no reason that A and B must 
actually be equal. We can prove, though, that A and B must be numeri- 
cally equivalent. 


224 


Theorem 6.2.6 


CHAPTER 6 INFINITE SETS 


However, a little thought will convince you that this last statement is not 
simple to prove. Indeed, if A x B, then we have an injection A — B, whereas 
B XA gives an injection B > A. But these injections are not necessarily 
inverses of each other, so we cannot immediately conclude that either is a 
bijection (that is, a surjection as well). 

This important result is known as the Schroeder—Bernstein Theorem, 
and has surprising and interesting consequences. 


Schroeder-—Bernstein Theorem Let A and B be sets, and suppose that 
Ax Band BXA. ThenA ~ B. 


PROOF: Let f: A — B and g: B — A be injections. We need to construct a 
bijection h: A > B, so leta € A. The key is to trace a back as far as possible 
through f and g. That is, if a = a, € g(B), let b; be the unique element of 
B such that a, = g(b;); if b; € f(A), let a, be the unique element of A such 
that b; = f(a); if a, © g(B), let b, be the unique element of B such that 
a, = g(b2); and so on. We might call this tracing the ancestry of a, and we 
can visualize it as follows: 


A B 


We thus construct a set {a = a,, b,, a2, b2, a3, b3, .. .} of ancestors of a. Now 
this may be a finite set: we may reach an oldest ancestor a; € g(B) or b; € 
f(A). Or the process may not stop, yielding an infinite set of ancestors. So 
let us define the following subsets of A: 


A, = {a € A|ahas an oldest ancestor a; € A}, 
Ag = {a € A | ahas an oldest ancestor b; € B}, 
A. = {a € A|ahas no oldest ancestor}. 


These sets are clearly pairwise disjoint and A, U Ag U Aj = A; that is, 
every element of A must satisfy exactly one of the three conditions. 


Corollary 6.2.7 


Corollary 6.2.8 
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Finally, notice that we could perform the same analysis on B, and arrive 
at analogous subsets Bz, By, and B. of B. In fact, f maps A, bijectively to 
B, and A.. to B.; similarly, g maps Bz; bijectively to Ag. (See the exercises. ) 
Thus we may construct the bijection h as follows: 


f(a), ifa€ AyU Az 
b, where a = g(b),ifa EG Ag. 


h(a) = | 

One consequence of the Schroeder—Bernstein Theorem and Cantor’s 
Theorem is that there must be an infinite number of infinite cardinals. For 
if A < P(A) for any set A, then P(A) < P(P(A)) < P(P(P(A))) <.... Each 
one of these sets has a different cardinality, giving us an infinite collection 
of cardinal numbers. In Exercise D2, you are asked to justify this statement. 

The Schroeder—Bernstein Theorem can often be used to give fairly easy 
proofs of the numerical equivalence of two sets when explicitly constructing 
a bijection is difficult. 


(0, 1) ~ [0, 1]. 


PROOF: By Exercise 9, since (0, 1) is a subset of [0, 1], we know that 
(0, 1) < [0, 1]. On the other hand, by Exercise 1(b), [0, 1] is numerically 
equivalent to [1/4, 3/4], which is a subset of (0, 1). Therefore, [0,1] < (0, 1), and 
the result follows immediately from the Schroeder—Bernstein Theorem. “? 


A similar argument can be used to prove that any two intervals of real 
numbers are numerically equivalent. See Exercise 14. 

An interesting consequence of Theorem 6.2.5 is the fact that since 
Z < P(Z), P(Z) must be an uncountable set. With the Schroeder—Bernstein 
Theorem, we can now prove that the cardinality of the power set of Z is in 
fact c. For convenience, we will actually look at the power set of Z*; in the 
exercises, you will be asked to show that the power set of any countable set 
is numerically equivalent to P(Z). 


P(Z*) ~R. 


PROOF: We will show that P(Z*) ~ [0, 1); this is sufficient since we know 
that R and [0, 1) are numerically equivalent. 

First, let us define an injection f: P(Z*) — [0, 1). For a subset A of Z*, 
let f(A) = 0.a;aqa; ... (a decimal expansion) where 


i ifiEGA 
1, ifiG A. 


Since no f(A) ends with an infinite string of 9’s, f(A) is always an element 
of [0, 1). Moreover, if f(A) = 0.a;qma;... = 0.b,b.b; ... = f(B) for subsets 
A and B of Z*, these decimal expansions are unique, and so a; = bj, for all 
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i € Z'. That is, i € A if and only ifi © B or A = B. We conclude that fis 
an injection, and so P(Z*) < [0, 1). 

To define an injection g: [0, 1) ~ P(Z*), we will use the binary expansion 
of elements of [0, 1): let 


g(0.a,a,a;...) = {i © Z* | a; = O}. 


Recall that the binary expansion is unique because we do not allow an infinite 
string of 1s. Therefore, there is no ambiguity in the definition of this function. 

If A = g(O.a,qa;...) = g(0.b,b,b; ...), then a; = b; = 0 if i © A, and 
otherwise a; = b; = 1. Thus, 0.a;qa;... = 0.b,b.b; ... proving that g is an 
injection. Therefore, [0, 1) x P(Z*). 


A closing comment may help you to understand this last proof. In con- 
structing f, we did not really need to use the decimal expansion. The ternary, 
or base 3, expansion would have sufficed, but the binary would not! However, 
the binary expansion is necessary to define g, whereas the decimal would 
not work. Can you see why these statements are true? You are asked to 
carefully explain them in the exercises. 


«= MATHEMATICAL PERSPECTIVE: THE PERILS OF INTUITION 


Cantor’s Theorem and its consequences are certainly surprising and to some 
extent counterintuitive. This is undoubtedly why the mathematical commu- 
nity found them so difficult to accept. However, there are many examples 
of startling results that defy our intuition. 

For example, mathematicians in the 19th century attempted to give a 
rigorous definition of the ‘“‘obvious”’ notion of a continuous curve, such as 


the following: 


Camille Jordan (1838-1922) gave a perfectly reasonable definition: a 
curve is a set of points (f(t), g(t)), where fand g are continuous functions 
of t € [0, 1]. Amazingly, Giuseppe Peano (1858-1932) found a function that 
meets this definition yet whose image is a square, the region [0, 1] x [0, 1]: 


1 


0 1 


This “space-filling”? curve is not at all what Jordan had in mind. Peano’s 
function is not a bijection; Eugen Netto (1846-1919) showed that a continu- 
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ous bijection from an interval to a square is impossible. Of course, the fact 
that a bijection exists is a consequence of the fact that the interval and the 
square have cardinality ec. 

But perhaps the most unbelievable result came in 1924. The Banach- 
Tarski Theorem states that a ball can be broken up into finitely many pieces, 
and then reassembled to form a ball twice as big as the original! This remark- 
able discovery was made in the field of measure theory, the attempt to 
generalize the notion of the length of an interval (or the volume of a ball) 
to arbitrary sets, and clearly revealed the need to understand the foundations 
of mathematical logic as well as the axioms of set theory. 


Exercises 6.2 


1. (a) Prove that every closed interval [a, b] is numerically equivalent to 
(0, 1]. 
(b) Prove that any two closed intervals [a, b] and [c, d] are numeri- 
cally equivalent. 


2. Prove that any two open intervals (a, b) and (c, d) are numerically equiv- 
alent. 


3. Let f: [0, 1] — (0, 1) be the function defined by f(0) = s.A() = 


ona for any n € Z* and f(x) = x otherwise. Prove that f is a bijection. 
4. Let [a, b] be a closed interval and (c, d) an open interval. Prove that 
[a, b] and (c, d) are numerically equivalent. 


5. (a) Prove that the function f(x) = — defines a bijection from R*, the 


set of positive real numbers, to the open interval (0, 1). 

(b) Prove that R* is numerically equivalent to any bounded open or 
closed interval. 

(c) Prove that any unbounded interval of the form (a, ©) is numerically 
equivalent to R. 

(d) Prove that any unbounded interval of the form (— ©, a) is numerically 
equivalent to R. 


6. Prove Lemma 6.2.4. 


7. Determine the cardinality of each of the following sets. Prove your 
answer. 
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12. 


13. 


14. 


15. 


16. 


17. 


(c) The set of irrational numbers 


(d) RXR 
(e) {(% p) ER X R|x ER, p isa prime} 
f 

() {fer x R,pisa prime} 


(g) F(Z*), the set of all functions f: Z* > Z* 
(h) the set of real numbers whose decimal expansions do not contain 
the digit 5 


. Let A, B, Cbe sets contained in a universal nonempty set U. The symbols 


< and < define relations on P(U). 
(a) Prove that < is transitive but not reflexive or symmetric. 
(b) Prove that < is reflexive and transitive but not symmetric. 


. Let A and B be sets such that A C B. Prove that A < B. 
10. 


Let A, B, C, and D be sets contained in a universal set U such that 
A < B and C X D. Prove that A X CX BX D. 


. Let A, B be sets contained in a universal set U. Suppose that A ~ B. 


Prove that P(A) ~ P(B). 
Hint: Exercises 21 and 22 of Section 3.2 will be helpful for this problem. 


Let A, B be sets contained in a universal set U such that A C B. Suppose 
that A is countable and B is uncountable. Prove that B — A is un- 
countable. 


Prove or disprove: the relation < is antisymmetric on P(U), where U is 
a universal set. 


Using the Schroeder—Bernstein Theorem, prove that any two intervals 

of real numbers are numerically equivalent. 

Hint: The possibilities are: (a, b), [a, b], (a, b], [a, b), (a, ©), [a, ©), (—%, b), 

(—~, b], and (—®, ~), 

Fill in the details of the proof of the Schroeder—Bernstein Theorem 

by showing: 

(a) f maps A, bijectively to B,. 

(b) f maps A.. bijectively to B.. 

(c) g maps Bs; bijectively to Ag. 

(d) fh is a bijection. 

Consider the proof of Corollary 6.2.8. 

(a) Explain why the proof that f is an injection will fail if the binary 
expansion is used, and why it would be valid for any n > 2 if the 
base 1 expansion is used. 

(b) Explain why the proof that g is an injection will fail if the base n 
expansion (n > 2) is used. 


Prove that there is no universal set of all sets. 
Hint: Let U be the set of all sets, and use Cantor’s Theorem to derive a contradiction. 
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Discussion and Discovery Exercises 


D1. Do you think that the set of all permutations of Z* is countable? Since 
this is a subset of the set of all functions from Z* to Z’, its cardinality 
can be no greater than your answer to Exercise 7(g). Also, it’s obviously 
infinite. (Proof?) For guidance, you might want to consider the finite 
case: the set of permutations of {1, 2,... , m} has cardinality n!, and 
n <n! as long as n is at least 3. 


D2. Prove the statement given after the proof of the Schroeder—Bern- 
stein Theorem: that there are an infinite number of infinite cardi- 
nals, by showing that, for an infinite set A, the sets P(A), P(P(A)), 
P(P(P(A))), ... all have different cardinality. Indicate where you 
actually use the Schroeder—Bernstein Theorem. 


COLLECTIONS OF SETS 


Russell’s Paradox 


In Section 2.3 we discussed arbitrary collections of sets and considered their 
union and intersection. Such a collection of sets or set of sets is usually 
described by an indexing set and this indexing set can be infinite. For example, 
for each positive integer i, let A; = (-i, i) = {x € R| -i < x < i} asin 
Example 3 of Section 2.3. Then the set S = {A;, Az, A3,...} is a collection 
of open intervals whose indexing set is Z*. Using the terminology of this 
chapter, we would say that S is a countably infinite collection of open intervals 
since the indexing set is countably infinite. Similarly, if r is a real number, 
let A, = (r, ©) = {x © R| x > r} as in Example 5 of Section 2.3. Here the 
indexing set is uncountable, so we say that the collection {A, | r € R} is an 
uncountable collection of sets. So there are many ways of constructing sets 
by taking sets of sets, sets of sets of sets, and so on. 

However, there are many pitfalls and surprises inherent in any casual 
approach to building sets. For example, we know (Exercise 17 of the previous 
section) that there is no object as enormous as “‘the set of all sets.” 

The mathematician and philosopher Bertrand Russell (1872-1970) dis- 
covered that there are severe logical problems with just assuming that any 
combination of sets results in another set, and his discovery shocked the 
mathematical world. He asked the following question: can a set be an element 
of itself? Certainly, most sets are not: {1} € {1}, since that set contains only 
one element, namely 1. On the other hand, if there were such a thing as the 
set of all sets, then as a set, it would have to be an element of itself. 

So Russell constructed .°7 = {A | A € A}, the set of all sets that are not 
elements of themselves. Then {1} € .:¥ but the set of all sets is not an element 
of ..% So far, so good. But then Russell asked: what about .°7itself? If .~ © 
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, then by its very definition, .°7 € .°7! And if.7 € .<% then this must mean 
that «7 E «A 

This argument has come to be known as Russell’s Paradox. It is not a 
semantic trick; it means that not every collection can, logically, be considered 
a set. In other words, the study of sets should properly begin (just like our 
study of the integers in Chapter 5 did) with axioms that specify how sets 
can be constructed. Many different lists of such axioms have been studied, 
and although we will not examine them in detail here (you may some day 
in a course in mathematical logic), we will make an assumption that is 
common to most: if all the sets we consider are subsets of one larger set—a 
universal set—then the operations of union, intersection, and Cartesian 
product generate sets. 

With this in mind, we can now begin to extend some of the earlier results 
of this chapter to arbitrary collections of sets. As before, in many examples, 
a universal set such as C, R, Q, or Z is clear, but its existence is always 
assumed, even if not explicitly stated. 


Countable Unions of Countable Sets 


We have seen that the finite union of countably infinite sets is countable. 
What about infinite unions? 


For every i € Z*, let A; be a countably infinite set. Then U A; is also 
ieZ* 
countably infinite. 
PROOF: For every i € Z*, we have a bijection f;: Z* — A;. Write f(j) = ai, 
for every j € Z*,so that U A; = {ai | i, | © Z*}. We may arrange the elements 
iez* 

of the union in a table and then enumerate them by tracing the diagonals, 
much as we did in proving that Q is countably infinite. 

Explicitly, let us define the diagonal height of a‘ to be the integer i + j. 
Then there are m — 1 elements in the union that have diagonal height m. 


Diagonal height m =5 


Thus, if we enumerate the elements of the union by their diagonal height 
(eliminating repeated elements, of course) we will have an enumeration of 
the entire union: 


Wad? gal we. gayle al! ade 8d ed 
{a}, 41,4}, a}, 43,43, a}, a3, a3, a4, ss Be 


EXAMPLE | 
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Because U A; can be written asa sequence, it follows that U A;is countably 
ieZ” ieZ” 
infinite. |= 


Notice that the uncountable union of countably infinite sets may well be 
uncountable. In fact, this may be true of the uncountable union of finite 


sets: Uf} =R. 
rER 


Another way to combine a finite number of sets, say n, is to form the 
Cartesian product, the set of all n-tuples with coordinates chosen from the 
respective sets. To do this for infinitely many sets, we must extend the idea 
of an n-tuple. Let’s begin with the easiest case, where we have countably 
many sets. 


Definition 6.3.2 Let {A; | i © Z*} be a countably infinite collection of 

sets. The Cartesian product of the collection, denoted Il A,, is the set 
iez* 

of all sequences of the form (a, a, a3,...,4;, ...) such that a; € A; for 

alli € Z*. 


Let A = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9}, the set of base 10 digits. Then Il A 

iez* 
(here we’ve chosen all of the sets to be the same) is the set of all sequences 
(a,, @, 43, ...) of the ten digits. 


This last example is reminiscent of the proof that [0, 1) is an uncountable 
set. In fact, we can mimic that proof to determine the size of many 
Cartesian products. 


Let {A; | i € Z*} be a countably infinite collection of sets, and suppose that 
each A; contains at least two elements. Then II A; is uncountable. 
iEZ” 
PROOF: Suppose, to the contrary, that Il A, is countable. Then there is a 
ieZ” 
bijection f: Z* > |] Aj, and we can write f(i) = .°%, for i € Z*. Now the 
iez* 

elements of the Cartesian product are sequences, so we can write .% = 
(ai, a5,a5,...), where ai € Aj. That is, 


A, = (al, a},a},...) 
oy = (at, a3, 03,...) 
ot, = (aj, a3, a3,...) 
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Now since each A; contains at least 2 elements, we can choose ele- 
ments b; € A; such that b; # ai, for every i © Z*. But then 4 = (b,, 
bz, b3,...) € Il Aj, yet B # %, for any i © Z*, a contradiction. «> 


iez* 


We have until now required our index set in forming a Cartesian product 
to be Z*. The reason is not hard to see. We were trying to extend n-tuples 
(a,, a, 43,...,4,) to infinite lists; naturally, we looked at sequences (a), a, 
a3,...) indexed by all positive integers. 

But what if the index set to our collection is uncountable? Can we still 
form a sensible product? The key lies in remembering that although our 
notation for a sequence, (a@;, a, 43, ...), resembles that for an n-tuple, the 
actual definition of a sequence is that it is a function with domain the 
index set Z*. This definition can easily be extended to any index set—finite, 
countably infinite, or uncountable. 


Definition 6.3.4 Let {A; | i € I} be a collection of sets. The Cartesian 
product of the collection, denoted I A;, is the set of all functions 
i€f 


fils U A, such that f(i) € A; for alli € 1. 


Before concluding, we must say a word about the significance of these 
last two definitions. It may seem intuitively clear that if A; ~ ©, for alli € 
I, then certainly Il A; must also be nonempty. However, this innocuous 

ie] 
statement turns out to be very deep. 

To be specific, one of the most common foundations for set theory is a 
system known as the Zermelo—Fraenkel Axioms. These axioms assure, among 
other things, the existence (as sets!) of the empty set, the power set of a set, 
the union of a set of sets, and the like. The consequences of these axioms are 
collectively referred to as Zermelo—Fraenkel Set Theory, or simply ZF 
Much of mathematics, and almost all of what we have done, can be carried 
out in ZF. 

However, the above statement cannot; that is, the statement 


If 1 A @ and A, # @, for alli € J, then [|] A; 4 @ 


i€] 


known as the Axiom of Choice, cannot be proven or disproven from the 
Zermelo—Fraenkel Axioms (Gédel proved that the Axiom of Choice is con- 
sistent with the Zermelo—Fraenkel Axioms and Cohen proved that the nega- 
tion of the Axiom of Choice is also consistent with the Zermelo—Fraenkel 
Axioms.) If the Axiom of Choice.is added to the list and assumed in addition 
to the other axioms, the resulting system is referred to as ZFC. 
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Many interesting and useful results can be proven in ZFC using the 
Axiom of Choice, and it is sometimes surprising when it is required. For 
example, it is needed to prove that the cardinal numbers are linearly ordered: 
if A and B are sets, then either A < B or BX A. We used it, in fact, in our 
proof of Theorem 6.3.1. (See the exercises.) 

It is safe to say that the vast majority of mathematicians today work in 
ZFC, and it has been our context from the beginning. 


“HISTORICAL COMMENTS: MATHEMATICS IS LOGIC 


Bertrand Russell’s discovery of the paradox mentioned at the beginning of 
this section sprung from his ambitious goal to describe a complete logical 
foundation for mathematics. This enormous effort bore fruit: Princi pia Math- 
ematica, three volumes written with Alfred North Whitehead (1861-1947). 
The Principia develops logic in a formal way (using notation similar to ours 
from Chapter 1 but much more strictly) and set theory, but with a specific 
caveat: a collection cannot be a member of itself. Thus, Russell and 
Whitehead explicitly avoid the paradox. 

Interestingly, another mathematician was not so clearsighted. Gottlob 
Frege (1848-1925) spent much of his career on a project similar to Russell’s: 
his two-volume Fundamental Laws of Mathematics. Shortly before the second 
volume was to go to press, he received a letter from Russell containing an 
exposition of the paradox. Frege’s dismay is best described by his own words: 
“A scientist can hardly meet with anything more undesirable than to have 
the foundation give way just as the work is finished.” 


Exercises 6.3 


1. Let {A; | i € J} be a collection of subsets of a set A and {B; | j € J} be 
a collection of subsets of a set B. Let f: A > B. 


(a) Prove that (Y A) = U f(A). 
(b) Prove that r(y z,) = Uf). 
(c) Prove that (0 A) C [1 f(A). 
(d) Prove that r(Q z,) = (1 f'(B). 


2. Suppose we define an order on R X R as follows: 


(a, b)<(c,d)@a<c or (@=candb<d). 
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This is called the lexicographic order. We define (a, b) = (c, d) in the 

obvious way. 

(a) Let (a, b), (c, d) © R X R. Prove that if (a4, b) S (c, d) and 
(c, d) S (a, b), then (a, b) = (c, da). 

(b) Let (a, b), (c, d), (e, f) © R X R. Prove that if (a, b) S (c, d) and 
(c, d) S (e, f), then (a, b) = (e, f). 

(c) Let (a, b) € R X R and let Aq» = (x, y) ER X R| @ y) > 
(a, b)}. Prove that the collection {Aq | (a, b) € R X R} is a de- 
creasing chain. (See the definition of decreasing chain in Sec- 
tion 2.3.) 


3. Let {A;| i € J} be a collection of sets and suppose that U A; is countably 


infinite. Must at least one of the Ajs be countably infinite? Prove or dis- 
prove. 


4. Let A = {0, 1}. Prove that the set II A is numerically equivalent to R. 
iez* 
5. Let A = {0, 1}. Prove that the set [] A is numerically equivalent to P(R), 
rEeR 


the power set of R. 
6. Let J be a set. Then [] Z is the set of all functions f: J > Z. We define 


iel 


the direct sum over J of Z, denoted ® Z, to be the set of all functions f: 
i€ 


I > Z such that f(i) = 0, for all but a finite number of elements i € /. 
(a) Prove that [] Z = ® Z © 1 isa finite set. 
ie! i= 


(b) Prove that if J is countable, so is ® Z. 


‘el 


7. Give a different proof of Theorem 6.3.1 by constructing a bijection 
FTOS BE 30 Ap 
iez* 
Hint: Think of U A; as a matrix with infinitely many rows and columns. 
iez* 


Discussion and Discovery Exercises 


D1. Precisely explain how the Axiom of Choice is used in the proof of 
Theorem 6.3.1. 


D2. Let {A;|i € Z*}be a countably infinite collection of sets. What conditions 
on the sets A; will guarantee that [] A; is countable? Try to be as 
iez* 
general as possible. 


7.1 


‘HE REAL AND 
COmPLExX NUMBERS 


FIELDS 


In Chapter 5 we studied the set of integers from the point of view of their 
axioms and the results that can be derived from these axioms. In this chapter 
we consider two other important sets: the set R of real numbers, which should 
be very familiar to you, and the set C of complex numbers, which you may 
find less familiar. 

Both the real and complex numbers satisfy similar axioms, many of 
them the same as those satisfied by the integers, but with some important 
differences. Any set that satisfies these axioms is called a field. In this section, 
we discuss fields in general as well as the notion of an ordered field. 


Definition 7.1.1 A field F is a nonempty set with two binary operations. 
The first is called addition and is denoted by +; the second is called 
multiplication and is denoted by - . These binary operations satisfy the 
following axioms: 


1. Addition is associative: (a + b) +c =a + (b+) for all a, b, 
cEF. 

2. Addition is commutative:a + b=b+aforalla,bEF. 

3. F has an identity with respect to addition: there exists an element 
O in F such thata + 0 = 0 +a =a for alla € F. 0 is called the 
zero element (or just simply zero) or additive identity of F. 
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4. Every element of F has an inverse with respect to addition: for 
each a in F, there is an element —a € F such that a + (—a) = 
~a+a=0. —a is called the additive inverse of a or negative a. 

5. Multiplication is associative: (a: b)-c = a: (b-c) for all a, b, 

cer. 

. Multiplication is commutative: a:b = b-a for alla, b € F. 

7. F has an identity with respect to multiplication that is distinct 
from the additive identity: there is an element 1 in F such that 
14O0Oanda:1=1-a=aforalla € F.1 iscalled the multiplicative 
identity (or just the identity) of F and is often just called one. 

8. Every nonzero element of F has an inverse with respect to multipli- 
cation: if a € F and a ¥ 0, then there exists an element a! € F 
such that a: a! = a!-a = 1. a‘ is called the multiplicative in- 
verse or simply the inverse of a. 

9. The following distributive law holds: a:-(b + c) =a:b+a:‘c 
for alla, b,c EF. 


ON 


Note that all of the axioms except 8 are also satisfied by the set Z 
of integers. But since only 1 and —1 have inverses in Z with respect to 
multiplication, Z is not a field. 


The set R of real numbers is a field with respect to the usual operations of 
addition and multiplication in R. The additive identity is the real number 
zero, which we write as 0, and the multiplicative identity is the real number 1. 

If a © R, a 4 0, then we will often write the multiplicative inverse of a 
as 1/a, in keeping with standard notation. Similarly, if a, b © R, b 0, then 
we write ab™! as a/b. 


The set Q = {¢ abEZ,bF o| of rational numbers is a field with respect 


to the usual way of adding and multiplying fractions. Note that if a/b € Q, 
a ~ 0, then the multiplicative inverse of a/b is b/a. 


Let F be a field. Note that we use the same notation for the additive 
identity of F, namely 0, that we use for the real number zero. The reason is 
simply because the real number 0 is the identity for addition in R and it is 
convenient to keep the same symbol. It should be clear from the context 
whether we are referring to the real number 0 or to the additive identity of F. 

Similarly, we use the symbol 1 to mean the real number one as well as 
the multiplicative identity of F. 


Proposition 7.1.2 


Proposition 7.1.3 
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Elementary Properties of Fields 


There are many more examples of fields. Since our focus in this chapter is 
on the real and complex numbers, we will restrict our attention to properties 
of fields that are most relevant to these examples. Many of these are quite 
familiar to you as properties of the real numbers, but they can be proved for 
arbitrary fields using only the axioms and results derived from those axioms. 

The first result collects statements identical to those in Propositions 5.1.1 
and 5.1.2. We state them without proof, because they are proved in exactly 
the same way as in Chapter 5, using only the axioms shared by Z and any 
field; namely, Axioms 1-7 and 9. 

First, some comments on notation: for simplicity and convenience, we 
will write ab instead of a- b for multiplication in a field. Also, we will use 
the familiar exponent notation: if a © F and n is a positive integer, then a” 
will mean the product of a with itself m times and a™ will mean the product 
of a“! with itself n times. 


Let F be a field and let a, b, cE F. 


1. a0 = 0a = 0. 

2. (—a)b = a(—b) = —(ab). 
3. —(-a) =a. 

4, (—a)(—b) = ab. 

5. a(b — c) = ab — ac. 

6. (-l)a = —a. 

7. (-1)(-1) = 1. 


The following proposition gives some properties of addition and multipli- 
cation in a field which are quite familiar as properties of the real numbers. 


Let F be a field and let a, b, c € F. 


1. Ifa+b=a+c,thenb=c. 
2. If ab = ac and a # 0, then b = c. 
3. If a # 0, then (a!) = a. 


PROOF: The proof is left as an exercise. “* 


Ordered Fields 


Recall that in Chapter 5, Axioms 9 and 10 for the integers gave two 
properties of the set Z* of positive integers. These properties are also satisfied 
by the set R* of positive real numbers. Guided by this example, we single 
out fields that have similar special subsets. 
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Definition 7.1.4 Let F be a field. We say that F is an ordered field if 
there exists a subset P of F satisfying the following two properties: 


1. P is closed with respect to addition and multiplication; that is, if 
x,y © P, thenx + y © Pand xy &€ P. 

2. For every element x in F, exactly one of the following statements 
is true:x € P,-x€ P,orx =0. 


If F is an ordered field with respect to the subset P, then P is called 
the set of positive elements of F. 


The set R of real numbers is an ordered field with P = R*. 
Q is an ordered field with P = Q’, the set of positive rationals. 


Recall that in Chapter 5, we used Axioms 9 and 10 for Z to introduce 
the notion of inequality in Z. We can do the same thing for ordered fields. 


Definition 7.1.5 Let F be an ordered field with respect to the subset 
P.If x, y © F, we say x < y or y > x (x is less than y or y is greater 
than x) ify —x EP. 


As was the case with integers, inequalities in an ordered field satisfy 
some basic properties. The following proposition is analogous to Propositions 
5.1.3 and 5.1.5 and the proofs are the same. 


Let F be an ordered field with respect to the subset P and let a, b, c € F. 


. Exactly one of the following holds: a < b, b < a, ora = b. 
. Ifa > 0, then —a < 0 and if a < 0, then —a > 0. 

. Ifa > 0 and b > 0, then a + b > 0 and ab > 0. 

. Ifa > 0 and b < 0, then ab < 0. 

. Ifa < 0 and b <0, then ab > 0. 

. Ifa<bandb<c, thena <c. 

._Ifa<b, thena+c<bt«. 

. Ifa <b andc > 0 then ac < be. 

. Ifa < bandc < 0, then ac > be. 

_Ifa€ FaF0,thena € P. 


SOMDANIADMA WNP 
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In Z, the set Z* satisfies the Well-Ordering Principle; every nonempty 
subset of Z* has a smallest element. A similar result is not true for R* or 
Q*. In fact there is no smallest element of R*, for if x € R*, then 3x < x, 
proving that no element of R* can be the smallest. 


Proposition 7.1.7 


Corollary 7.1.8 


Theorem 7.1.9 


7.1 FIELDS 239 


This last statement is true in general: in any ordered field F the subset 
P has no smallest element. (See the exercises.) 

The next result gives some additional properties of the subset P of an 
ordered field F. 


Let F be an ordered field with respect to the subset P. Then: 


11€EP. 
2. Ifx € P, thenx'€P. 
3. P is an infinite set. 


PROOF: 1. Suppose that 1 ¢ P Then-1 € Pand since P is closed under mul- 
tiplication, (-1)(-1) € P But (-1)(-1) = 1 by part 7 of Proposition 7.1.2. So 
we are led to the contradiction that if 1 € P then 1 € P. Therefore, 1 € P. 

2. The proof is left as an exercise. 

3. Since 1 € P and P is closed under addition, the elements 1 + 1, 1 + 
1+ 1,... are all in P Now suppose that two sums of this form are equal, 
sayl+1+...+1(atimes)=1+1+...+ 1(m times) withn > m. 
Then, by part 1 of Proposition 7.1.3, we would have 1 +1+...+1(n-m 
times) = 0. But this implies that 0 € F a contradiction. Therefore the 
sums 1+1+...+ 1 (n times), n € Z’, are all distinct, giving us infinitely 
many elements of P. ©) 


Every ordered field is infinite. 


Finite Fields 


In Section 5.5 we discussed the set Z, of congruence classes mod n where 
n is a fixed positive integer greater than 1. Theorems 5.5.5 and 5.5.6 show 
that Z,, satisfies Axioms 1-7 and 9 of a field. The additive identity is the 
congruence class [0] and the multiplicative identity is the class [1]. 

However, for some n, Axiom 8 will not be satisfied. For example, in Z, 
there is a nonzero element that does not have a multiplicative inverse. (Which 
one?) So Z, is not a field. However, it is easy to see that Z, and Z; are fields. 
So exactly which Z,, are fields and which are not? 


Z,, is a field if and only if n is a prime number. 


PROOF: First suppose that 1 is a prime number. As just pointed out, any Z,, 
satisfies all of the axioms of a field except possibly Axiom 8. So it suffices 
to show that any nonzero element of Z,, has a multiplicative inverse when 
n is prime. 

Let [a] € Z,, [a] ¥ [0]. Since a and 0 are not in the same congruence 
class, it follows that a is not congruent to 0 mod n. Hence n does not divide 
a and since n is prime, a and n must be relatively prime. Then by Theorem 
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5.3.8, there exist integers x and y such that ax + ny = 1. Hence [a][x] + 
(n][y] = [ax] + [ny] = [ax + ny] = [1]. But [n] = [0] since n = O(mod n). 
Thus [a][x] = [1], so [x] is the multiplicative inverse of [a]. Therefore, if n 
is a prime number, Z, is a field. 

Now we want to prove the converse: if Z, is a field, then n is a prime 
number. We will prove the contrapositive of the converse: if 7 is not a prime 
number, then Z,, is not a field. 

Let n be a composite number. Then there exist integers a and b such 
that n = ab where 1 <a<nand1<b <n. We get 


[a][b] = [ab] = [n] = [0]. 


Now note that [a] ¥ [0] since 1 < a <n and [b] # [0] since 1 <b <n. 
We claim that [a] does not have a multiplicative inverse. Suppose, on the 
contrary, that [a] does have a multiplicative inverse. Then there exists an 
integer x such that [x][a] = [1]. Multiplying both sides of this equation by 
[b], we get 


(Lx][@)[5] = [1J[] = [5]. 


Since multiplication in Z,, is associative, we have 


[x]([a][b]) = [b}. 


But [a][b] = [0] from above. Hence [b] = [x][0] = [x0] = [0]. This is a 
contradiction, since [b] ¥ [0]. Therefore, [a] does not have a multiplicative 
inverse, proving that Z, is not a field. #9 


Notice that by Corollary 7.1.8, if p is a prime, then the field Z, is not an 
ordered field. 


“e MATHEMATICAL PERSPECTIVE: OTHER FIELDS 


We have seen relatively few examples of fields in this section: Q, R, C, and 
the finite fields Z,. Are there others? In trying to answer a question like 
this, mathematicians often take a very commonsense approach: use the exam- 
ples we know to construct new and different ones. 

Suppose we start with Q and a real number not in Q, say V2. The field 
R contains both Q and 72. of course, but is there a smallest field that does 
as well? Essentially, we can just build that field F If QC FCR, ee = F, 
and F is a field, then since F is closed, F must also contain, for example, 
2V2,1 - V2,%, (V2)3, and all elements of this type. However, since re 
4/2 and (V2) = 2, it is not hard to see that F = {r + sV2|r, s € Q}. We 
denote this field Q(V2). Now obviously Q(V2) # Q, but it is not so clear 
that Q(V2) # R. 

To prove this last statement, we can adapt our proof that V2 is irrational 
and show that V3 ¢ Q(V2). Suppose to the contrary that V3 = r + 
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sV2, where r, 5 € Q. Then 3 = 7? + 2s? + 2rsV2, and this would mean that 
V2 is rational. (You should check the special cases r = 0 and s = 0.) 

Q(Vv2) is called an algebraic number field, and is clearly one of a plethora 
of other fields: we could just as easily construct Q(V3), Q(W4), and infinitely 
many others. The simple representation we found, however, is due to the 
fact that V2 is a solution of the equation x? — 2 = 0, in which the coefficients 
are all rational numbers. No such polynomial equation can be found for 7, 
and so Q(z), the smallest field of real numbers containing Q and 7, has no 
analogous representation. This very deep fact—that 7 is a transcendental 
number—was first proved by C. L. F. Lindemann (1852-1939). It is an 
indication of the rich variety of infinite fields. 

The idea of adding solutions of equations to a field led Evariste Galois 
(1811-1832) to construct new finite fields from Z,. The difference was that 
Galois added solutions @ of equations f(x) = [0], where f(x) is a polynomial 
with coefficients in Z,. If f(x) is irreducible (that is, does not factor), he 
showed that Z,(q) is a finite field, and that |Z,(a)| is a power of p. Later, 
E. H. Moore (1862-1932) showed that any finite field can be constructed in 
this way. 

In short, mathematicians today have a very clear picture of the structure 
of finite fields, whereas the nature of infinite fields, in particular algebraic 
number fields, remains an active area of research. 


Exercises 7.1 


1. Prove Proposition 7.1.3. 


2. Let F be a field and let a, b € F. 
(a) Suppose that ab = 0. Prove that a = 0 or b = 0. 
(b) Suppose that a? = b*. Prove that a = +b. 


3. Let F be a field. Leta € Fra #0,andn € Z. 
(a) Prove that a” # 0. 
(b) Prove that a” = (a")"1. 


4. Let F be a field and let a, b € F, with a # 0. Prove that the equation 
ax = b has a unique solution x in F. 


5. Let F be a field and let a, b € F. 
(a) Prove that (aby = ab’. 
(b) Prove that (ab)" = a"b" for every positive integer n. 
(c) Prove that ifa, b 40, then (ab)~" = a-"b™ for every positive integer n. 


6. Let F be a field and let a € F, a ¥ 0. Let n and m be positive integers. 
(a) Prove that a"a”" = a"*™. 
(b) Prove that a"a~" = a". 
(c) Prove that a"a™" =a", 
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7. 


11. 


12. 


13. 


14. 
15. 


16. 


17. 


18. 


19. 


(a) Prove that between any two rational numbers there is another ratio- 
nal number; that is, if a, b € Q and a < 5, then there exists z € Q 
such thata <z <b. 

(b) Prove that between any two rational numbers there are infinitely 
many rational numbers. 


. Does the set of irrational numbers form a field? 
. Prove that if x is an irrational number, then x7! is irrational. 


10. 


Let F be an ordered field with respect to the subset P. 

(a) Prove part 2 of Proposition 7.1.7; that is, if x € P, then x7! € P. 
(b) Prove that if x, y € Pandx < y,thenx! > yl. 

(c) Prove that if x, y € F then x? + y? = xy and x? + y? = —xy. 


Let F be an ordered field. Let a, b, c, d € F. Prove or disprove: if a < 
b and c < d, then ac < bd. 


Let F be an ordered field with respect to the subset P. Let x, y € P. 
(a) Prove that if x < y, then x? < y’, 

(b) Prove that if x? < y’, then x < y. 

(c) Prove that if x < y, then for any positive integer n, x" < y”. 


Let F be an ordered field and let a € F. Prove that there exists b € F 
such that b > a. 


Prove that in any ordered field F, the subset P has no smallest element. 


Prove that Q* is the only subset of Q that makes Q an ordered field; 
that is, if Q is an ordered field with respect to a subset P, then P = Q*. 


Let F be an ordered field with respect to the subset P. Prove that —1 is 
not a square in F: there does not exist an element x in F such that x* = —1. 


Let F be a field with the following two properties: 

(1) —1 is not a square in F. 

(2) the sum of any two nonsquares of F is a nonsquare of F. 
Let P be the set of nonzero squares of F. 


P={x € F|x = y’ forsome y € F, y # 0}. 


Let N be the set of nonsquares of F. 

(a) Prove that if x € P, then —x EN. 

(b) Prove that if x € N, then —x € P. 

(c) Prove that F is an ordered field with respect to the subset P. 


Do not use Theorem 7.1.9 to do this problem. 

(a) Prove that Z, is a field. 

(b) Prove that Z, is not a field. 

(c) Prove that Z is not a field. 

(d) Let F = {[0], [2], [4]} C Z.. Prove that F is a field. 


(a) Prove that the equation [2]x = [3] has no solutions in Z,. 
(b) Prove that the equation [3]x = [3] has three solutions in Ze. 


7.2 
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20. Prove that Q(V2) = {r+ sV2 | r, s © Q} is a field. (You may assume 
that addition and multiplication in Q(Vv2) are associative and commuta- 
tive since Q(V2) C R.) 


Discussion and Discovery Exercises 


D1. Suppose that F is an ordered field with respect to the subset P. By 
Exercise 14, we know that P has no smallest element. Use this fact to 
give a different proof of Corollary 7.1.8, that F is infinite. 


D2. Q is a subset of R and is also a field. We would say that Q is a subfield 
of R, since we are using the same addition and multiplication. Are there 
any subfields of Q? Are there any subfields of Z,? 


THE REAL NumBERS 


Bounded Sets 


We saw in Section 7.1 that both R and Q are ordered fields. It is possible 

to construct the real numbers from the rational numbers, but the construction 

is long and fairly complicated so we will not attempt it here. However, one 

property that distinguishes the real numbers from the rationals is an impor- 

tant one that we will take as an axiom, the Least Upper Bound Axiom. 
We first need some definitions. 


Definition 7.2.1 Let S be a nonempty set of real numbers. 


1. We say that S is bounded above if there exists a real number x 
such that a = x for all a € S. The number x is called an upper bound 
for the set S. 

2. We say that S is bounded below if there exists a real number y 
such that a = y for all a € S. The number y is called a lower bound for 
the set S. 

3. If S is bounded above and bounded below, we say that S is a 
bounded set. 


The closed interval [0, 1] is a bounded set. Any number greater than or 
equal to 1 is an upper bound for S and any number less than or equal to 0 
is a lower bound. 
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The open interval (0, 1) is a bounded set. As with Example 1, any number 
greater than or equal to 1 is an upper bound for S and any number less than 
or equal to 0 is a lower bound. 


The set Z* of positive integers is not bounded above. It is bounded below 
and any number less than or equal to 1 is a lower bound. 


The interval (—, 0] is bounded above but not bounded below. 


Least Upper and Greatest Lower Bounds 


Notice that an upper or lower bound for a set S may or may not be in S. 
But in both Examples 1 and 2, the number 1 is clearly an important upper 
bound: it is the smallest of all. (We will prove this shortly.) Similarly, 0 is 
the /argest lower bound in both examples. The existence of these special 
bounds in every case is the essence of the axiom we mentioned previously. 


Definition 7.2.2 


1. Let S be a set of real numbers that is bounded above. We say 
that x is the least upper bound for S if x is an upper bound for S and if 
x <= z whenever z is an upper bound for S. 

2. Let S be a set of real numbers that is bounded below. We say 
that y is the greatest lower bound for S if y is a lower bound for S and 
if y = z whenever z is a lower bound for S. 


The least upper bound of [0, 1] is 1 and the greatest lower bound is 0. 


We claim that the least upper bound of (0, 1) is 1. Clearly 1 is an upper 

bound for (0, 1). Now suppose 0 < z < 1. There exists a real number y 

between z and 1; namely y = (z + 1)/2. Since y € (0, 1), it follows that z 

is not an upper bound for (0, 1). This shows that no real number less than 

1 is an upper bound for (0, 1). Therefore 1 is the least upper bound of (0, 1). 
Similarly, 0 is the greatest lower bound of (0, 1). 


The set Z* does not have a least upper bound since it is not bounded above, 
and 1 is the greatest lower bound. 


The interval (—, 0] has 0 as its least upper bound and does not have a 
greatest lower bound. 
We can now state our final axiom for the set of real numbers. 


Least Upper Bound Axiom: Any nonempty subset of R that is bounded 
above has a least upper bound. 


Theorem 7.2.3 


Corollary 7.2.4 


Lemma 7.2.5: 
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It seems reasonable to expect that any nonempty subset of R that is 
bounded below has a greatest lower bound. This is true but we need not 
take it as an axiom, for it can be proved from the Least Upper Bound Axiom. 
(See Exercise 2.) 

The least upper bound and greatest lower bound of a set, if they exist, 
are unique. (See Exercise 3.) We will denote the least upper bound of a set 
S by lub S and the greatest lower bound by gilb S. 


The Archimedean Principle 


An important consequence of the Least Upper Bound Axiom is the following 
theorem, known as the Archimedean Principle. 


Let a and b be positive real numbers. Then there exists a positive integer n 
such that na > b. 


PROOF: The proof is by contradiction. Suppose that na = b for all positive 
integers n. Let S = {na | n € Z*}. Clearly S is a nonempty set, and by our 
assumption b is an upper bound for S. By the Least Upper Bound Axiom, 
S has a least upper bound. Call it z. 

Since a > 0, z -— a < z. Thus z — a is not an upper bound for S. 
Consequently, some element of S must be greater than z — a; that is, there 
exists x © S such that x > z — a. Since x € S, x = ma for some positive 
integer m. Then ma > z — a and it follows that z < ma + a = (m + l)a. 
But (m + 1)a is in S, giving us a contradiction since z is an upper bound 
for S. The theorem now follows. ©» 


If b is any positive real number, then there exists a positive integer n such 
that 1/n < b. 


PROOF: Applying the Archimedean Principle with a = 1 and b replaced by 
1/b, there exists a positive integer n such that n > 1/b. It now follows from 
Exercise 10(b) of Section 7.1 that 1/n < b. © 


Another important consequence of the Archimedean Principle is the fact 
that between any two real numbers there is a rational number. To prove 
this, we need to prove a lemma first. 


Let x © R. Then there exists an integer m such thatm —- 15x <m. 


PROOF: Suppose first that x > 0. By the Archimedean Principle, there exists 
a positive integer n such that n > x. By the Well-Ordering Principle, there 
must be a smallest such integer; call it m. Then m > x, and since m — 1 < 
mm-—1sx. 

If x = 0, the result is obvious by letting m = 1. 
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Suppose finally that x < 0. Then —x > 0, so by the Archimedean Principle 
again, there exists a positive integer m) such that ny > —x. Then x + my > 
0, so by our first case, there exists an integer my such that m) — 1 Sx + nN 
<m. Letm =m — ny). Thenm-1sx<m.e 


Between any two real numbers there is a rational number. 


PROOF: Let x, y © R such that x < y. Then y — x > 0, so by the Archimedean 
Principle, there exists a positive integer n such that n > 1/(y — x). It follows 
that ny — nx > 1 or nx + 1 < ny. By the previous lemma, there exists 
m © Z such that m — 1 S nx < m. We get m = nx + 1 < ny and hence 
nx <m < ny. It now follows that the rational number m/n is between x 
and y. 3? 


Incompleteness of Q 


We mentioned earlier in this section that the Least Upper Bound Axiom 
distinguishes the real numbers from the rational numbers. To justify that 
statement, we need to demonstrate that Q does not satisfy the Least Upper 
Bound Axiom—that there is a nonempty subset of Q that is bounded above 
but does not have a least upper bound in Q. 

Let S = {x € Q|x > O and x? < 2}. Clearly, S ¥ © and S is bounded 
above since 2 is an upper bound for S. (Why?) Suppose that S has a least 
upper bound in Q. Call it z. We will show that z* = 2. This will be a 
contradiction since it was proved in Section 1.2 that V2 is not a rational 
number. 

Assume that z? < 2. Then z < V2 by Exercise 12(b) of Section 7.1. By 
Theorem 7.2.6, there is a rational number r such that z <r< V2. Hence 
z° <r <2, implying that r € S. This is a contradiction since z is an upper 
bound for S. 

Suppose, on the other hand, that z* > 2. Then z > A/D: so there exists 
a rational number r such that z > r > V2. We get z*° > r? > 2. Thus if 
w € S, w? < 2 <r’, and therefore w < r. Thus r is an upper bound for S, 
a contradiction since r < z and z is the /east upper bound for S. 

Therefore, we must have z? = 2, and, as was pointed out previously, this 
is a contradiction since V2 is irrational. We conclude that S does not have 
a least upper bound in Q. 

In this argument, we assumed that the integer 2 has a square root in R. 
In fact all positive real numbers have a square root in R, and we conclude 
this section with a proof. 


If x € R, x > 0, then there exists y € R, y > 0, such that y’ = x. 


PROOF: Let S = {t€ R|t >Oand t? < x}. We first show that S is a nonempty 
set. Let t = x/(x + 1). ThenO <t<1landt<xso0<?t?<t<_x. Thus 
tES,soS# ©. 
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We now show that S is bounded above. Specifically, we will show that 
x + 1 is an upper bound for S. Suppose there exists t € S such that x + 
1 <t Then (x + 1)? < ? < x. By Exercise 12(b) of Section 7.1, we get that 
x + 1 <x, a contradiction. Therefore, t = x + 1 for allt € S. Hence x + 1 
is an upper bound for S. 

Now we can apply the Least Upper Bound Axiom to S to conclude that 
S has a least upper bound. Call it y. Clearly y > 0. We will prove that y? = 
x by showing that if y? < x or if y? > x, we get a contradiction. 


— 2 
Assume that y’ < x. Then - ma > 0. By Corollary 7.2.4, there is a 
eae 1ox-y 
positive integer such that < . Hence we have 


2y+1 


=yt+ *y +1) 
<y' + (x—y’) 
=X. 
Thus y + . € S, a contradiction to the fact that y is an upper bound for S. 


yiax 


Now suppose that y? > x. Then > 0. By Corollary 7.2.4, there is 


an 
a positive integer m such that + < oe Therefore, 


iy 2 1 
Ute) ane 


m m 
2y 
>y-= 
y m 
ye) 


Se 


2 
Now if w € S, then Ww <x < (> = 1) from which it follows that w < 


y — I/m. (See Exercise 12(b) of Section 7.1.) Hence y — 1/m is an upper 
bound for S, contradicting the fact that y is the /east upper bound of S. We 
can now conclude that y* = x. @? 


If x is a positive real number, then there is only one positive real number 
y such that y’ = x. (See Exercise 14.) We use the usual symbol Vx for y. 
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HISTORICAL COMMENTS: THE LOGICAL FOUNDATION OF R 


The Intermediate Value Theorem, which we stated in Chapter 3, is an 
important theorem of calculus, and it can be made obvious by drawing 
a picture: if f is a continuous function on the closed interval [a, b], and 
f(a) < 0 < f(b), then there is a c € (a, b) such that f(c) = 0; that is, the 
graph of f must cross the x-axis between a and b. 


We did not prove the theorem in Chapter 3, in part because it involves 
the nature of R: the real numbers must be represented by a continuous line. 
Mathematicians in the 19th century realized that to rigorously prove the 
theorem (and many others from calculus), a firm, logical basis for R was 
needed, and several attempts were made to give one. 

At the time, the rational numbers were thought to be completely under- 
stood, or at least natural enough to need no formal definition. So the problem 
was how to define the irrational numbers in such a way as to catch the 
continuity of the correspondence between R and the line. One approach 
was taken by J. W. R. Dedekind (1831-1916), who, remarkably, took his 
cue from ideas that can be found in Euclid’s Elements. 

Dedekind’s insight was that each point divides the line into two sets: 
those to the left and those to the right. He then starts with the rational 
numbers, and identifies real numbers a with ordered pairs of sets of rationals: 
those less than or equal to a and those greater. For example, 0 is identified 
with (Ay, By), where Ay = {r € Q |r < 0} and B, = {fr € Q| r > O}. More 
importantly, 79: an irrational number, is identified with (Avz, Bvz), where 
Ava = {rE Q|r<O0orr? = 2} and By = {fr € Q|r > 0 and r? > 2}. 

Thus R becomes a set of ordered pairs of sets of rational numbers. (The 
ordered pairs have become known as Dedekind cuts.) On this complicated 
set, Dedekind defined operations (+, -, —, +) and order (<, >, =), and 
then proved many of the expected properties, such as the transitivity of >. 
He expressed the continuity of the line by proving that if R is divided into 
two nonempty sets A and B with every element of A less than every element 
of B, then there is exactly one real number a = (A, B). Among other things, 
the Intermediate Value Theorem becomes provable in this logical system. 

Needless to say, some mathematicians did not find this approach entirely 
satisfactory. However, it soon became clear that any successful design would 
have to be quite sophisticated. For example, Georg Cantor (1845-1918) 
defined an irrational number as a sequence of rational numbers 7, ,73,... 
that eventually cluster: that is, for any « > 0, |r, — 7,| < ¢ for n and m 
sufficiently large. (Notice that the number is not the “‘limit”’ of the sequence; 
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Cantor was defining the number, so he identified it with the sequence itself.) 
Cantor then faced the same difficulties as Dedekind: he defined order and 
operations, and proved all the expected theorems. His real numbers were 
also infinite sets, in his case sequences of rationals. 

One final remark: it is fascinating to realize that the historical develop- 
ment of the logical basis for the real numbers came in “‘reverse.”’ The simplest 
reals are the integers, followed by the rationals, and finally the irrationals. 
Yet the first commonly accepted axiom system for Z was given by Guiseppe 
Peano (1858-1932), after Dedekind’s definition of irrationals! 


Exercises 7.2 


1. Find the least upper bound and greatest lower bound (if they exist) for 
the following sets. (No proofs necessary.) 
(a) S=[- 1, ©) 
(b) S= {ke EQ] x? <5} 
(c) S={r EQ|x < 4} 


(d) s={i-1 neZ| 
n 


(e) s= {1+ neZ| 
(f) S=Z 


2. Let S be a nonempty subset of R that is bounded below. Prove that S$ 
has a greatest lower bound. 


3. Prove that the least upper bound of a nonempty subset S of R, if it 
exists, is unique. : 


4. Let a, b € R such that a < b. Let S = (a, Db). 
(a) Prove that lub S = b. 
(b) Prove that glb S = a. 


5. Let S CR, S # &. Suppose that there exist an upper bound x for S such 
that x © S. Prove that x = lub S. 


6. Let S CR, S # ©. Suppose that there exists a lower bound y for S such 
that y € S. Prove that y = glb S. 


7. Let a, b € R such that a < b. Let S = [a, b]. 
(a) Prove that lub S = b. 
(b) Prove that glb S = a. 


8. Let S and T be nonempty bounded subsets of R such that S C T. 
(a) Prove that lub S = lub 7. 
(b) Prove that glb S = glb T. 


9. Let S be a nonempty set of real numbers. Complete the following state- 
ments: 
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(a) S is not bounded above if... . 
(b) S is not bounded below if... . 
(c) S is not bounded if... . 


10. Let S be a nonempty set of real numbers that is bounded above. Let 
y = lub S. Prove that for every positive real number ¢, there is a real 
number z € S such that z > y — «. 


11. Let S be a nonempty set of real numbers that is bounded below. Let 
y = glb S. Prove that for every positive real number ¢, there is a real 
number z € S such thatz<y+e. 


12. Without using the Least Upper Bound Axiom, prove that Q?* satisfies 


the Archimedean Principle; that is, if x, y € Q*, then there exists a 
positive integer ” such that nx > y. 


13. (a) Write the negation of the Least Upper Bound Axiom. 
(b) Write the negation of the Archimedean Principle. 
(c) Write the negation of Theorem 7.2.6. 

14. Theorem 7.2.7 shows the existence, for each positive real number x, of 
a positive real number y such that y’ = x. Prove that y is unique: if there 
exists a real number z such that z* = x, then z = y or z = —y. 


15. Prove that if x, y € R such that 0 < x < y, then Vi < Vy. 


16. Let x, y © R and suppose that x < y + e for all positive real numbers 
e. Prove that x = y. 


17. Prove that between any two real numbers there are infinitely many 
rational numbers. 


18. Prove that between any two real numbers there is an irrational number. 


Discussion and Discovery Exercises 


You may remember that the rational numbers can be characterized as those 
real numbers whose decimal expansion contains a sequence of digits that 
repeats infinitely. For example, 


= = 3.142857142857. 


(Here, the bar means that the sequence will repeat forever.) 
Conversely, the irrational numbers are those real numbers with no such 
repeating pattern. For example, 


V2 = 1.414213562... 
m = 3.141592653.... 


7.3 
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D1. Does this description make you think that it might be harder to prove 
that a number is irrational than rational? Explain. 


D2. Use this description to argue that between every two real numbers there 
are both a rational number and an irrational number. Criticize the 
logical validity of your argument. 


THE CompLex NUMBERS 


In Chapter 8 we will be studying solutions of polynomial equations. Finding 
such solutions can be difficult and some equations have no solutions in a 
given field. The polynomial equation x’ + 1 = 0, for example, has no solutions 
in the field of real numbers. To find a solution we need to find a set containing 
the reals that also contains an element whose square is —1. Such a set is 
the complex numbers and remarkably, as we will see in Chapter 8, the 
Fundamental Theorem of Algebra tells us that this set is ‘‘rich’”” enough to 
contain the solutions of all polynomial equations with real coefficients. 

We begin by defining the complex numbers and proving that they form 
a field. 


Definition 7.3.1 The set C of complex numbers is the set of all ordered 
pairs (a, b) of real numbers together with two binary operations of 
addition and multiplication defined as follows: 


(a,b) + (cg d) =(at+ceb+ a) 
(a, b) + (c, d) = (ac — bd, ad + bc) 


The definition of multiplication seems unusual but, as we will see, it gives 
us an element whose square is —1. 

These two binary operations make C a field. Of course, all of the axioms 
must be verified, but these are routine and some are left as exercises. (See 
Exercise 1.) The additive identity is (0, 0) and the additive inverse of (a, b) 
is (—a, —b). The multiplicative identity is (1, 0) and multiplication is easily 
seen to be commutative. It is easy to show that if a # 0 or b # 0, then the 
multiplicative inverse of (a, b) is 


ish, ee 
Peo a Spy 


So we have the following result. 
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C is a field with respect to the binary operations defined in Definition 7.3.1. 


Some comments are in order before we proceed. First, by the nature of 
our definition, two complex numbers (a, b) and (c, d) are equal if and only 
if a = cand b = d. Another important fact to note is that C is not an ordered 
field since —1 is a square in C. (See Exercise 16 of Section 7.1.) Finally, it 
is common to identify the real number a with the point (a, 0) in C. Because 
these points form a subset of C that is closed under addition and multiplica- 
tion (check!), we can consider the field R as a subfield of C; that is, a subset 
of C which is itself a field with respect to the binary operations of addition 
and multiplication in C. 

Now suppose we consider the point (0, 1), which we denote i. Then i? = 
(0, 1)(0, 1) = (—1, 0), and since we identify (—1, 0) with the real number 
—1, we have the important property that i? = —1. As discussed previously, 
this fact distinguishes the complex numbers as a set containing the real 
numbers and also containing a solution to the equation x’ + 1 = 0. 

If (a, b) € C, (a, b) = (a, 0) + (0, b) = (a, 0) + (6, 0)(0, 1) = a + Bi. 
From now on, we will use the expression a + bi for the complex number 
(a, b). This notation makes computations with complex numbers easier to 
perform. Addition and multiplication with complex numbers can be done 
in the usual way that we treat algebraic expressions of real numbers, with 
the added rule that i? = —1. That is, 


(a + bi)(c + di) = ac + bci + adi + bdi? 
= (ac — bd) + (ad + be)i. 


If z =a-+t bi € C, then a is called the real part of z, denoted by Re z, 
and b is called the imaginary part of z, denoted by Im z. So two complex 
numbers are equal if and only if their real and imaginary parts are equal. 

We must not lose sight of the fact that a complex number can be repre- 
sented as a point in a plane, which we call the complex plane. We will see 
later in this section that important properties of complex numbers can be 
derived by writing them in polar coordinates. 


Conjugation and Absolute Value 


Definition 7.3.3 If z= a+ bi € C, then a — bi is called the complex 
conjugate of z and is denoted by z. The absolute value of z is 
Va? + b? and is denoted by |z]. 


Geometrically, the absolute value of z = a + bi is the length of the line 
segment from the origin to the point (a, b). 
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<> 
a=Rez 


The following lemma lists some important properties of the complex 
conjugate. The proof is left as an exercise. 


Lemma 7.3.4 Let z, w © C. Then 


EXAMPLE | 


Zw : 
Zz = Iz? E R, and therefore is a nonnegative real number. 
zt+z=2RezER. 
zi=~ itz £0. 

Iz| 


aaa a a 


Part 5 of Lemma 7.3.4 gives a straightforward way to compute the multipli- 
cative inverse of a complex number. 


ee ee ne, 
_ j)yi == Hj 
CO Ne aan ges ga 


Solutions of Equations 


As we have seen, the polynomial equation x’ + 1 = 0 has a solution in the 
complex numbers. In fact, it has two solutions: i and —i. As we noted at the 
beginning, the field C of complex numbers contains the solutions of every 
polynomial equation p(x) = 0 with real coefficients. The fact that such a 
polynomial equation has a solution in C is the content of the Fundamental 
Theorem of Algebra, which we will discuss in more detail in Chapter 8. 
Along the way, however, we will see special cases of this theorem. 

The first and easiest special case is the equation ax + b = 0 where a, 
b © R, a # 0. The unique solution is x = —b/a. 

The general quadratic equation ax’ + bx + c = O witha, b,c ER 
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and a # 0 can be solved using the well-known quadratic formula, giving 
the solution: 


ee =p Mb = Age 
2a ‘ 


If b? — 4ac = 0, there is one real solution, and if b? — 4ac > 0, there are 
two real solutions. If, however, b? — 4ac < 0, the quadratic formula gives 
two nonreal solutions in C. 

The general cubic equation ax? + bx’ + cx + d = 0 also has a general 
formula for its solution, but it is considerably more complicated than the 
quadratic formula and will not be given here. We will discuss the cubic 
formula and some special cubic equations in Chapter 8, however. 

Notice that the quadratic formula implies that if the complex number z 
is a solution of ax? + bx + c = 0, then its complex conjugate is also a 
solution. A more general result is the following: 


Let p(x) = a,x" + a,x"! +... + ax + ay be a polynomial of degree n, 
n = 1, with real coefficients. If z © C is a solution of p(x) = 0, then its 
complex conjugate is also a solution. 


PROOF: First note that if r is a real number, then 7 = r and that for any z 
in C, z* = z* for all positive integers k. (See Exercise 4.) Thus if p(z) = 0, then 


DP(Z) = anZ" + AQyyZ™ +... + az + a 
=G,2 +442" +...+%74+% 
=G,2°+ G42"! +...4+G7+% 
=a,2"+a,.2°'+...+ 2+ m% 
=a,e" age Fes Fe Pe 


Let p(x) = x° + x? — x + 2. By trial and error, one can see that p(—2) = 
0 and that p(x) = (x + 2)(x? — x + 1). Using the quadratic formula, we see 
that the solutions of x? — x + 1 = 0 are 1/2 + V3/2i and 1/2 — V3/2 i. 
These two solutions together with —2 give all of the solutions of the equation 
x3 +x? —x +2 =0. This fact is not obvious but will follow from results in 
Chapter 8. 


In Chapter 8, we will also discuss ways of finding integer solutions of 
polynomial equations. 


Polar Form 


Let z = x + yi be a nonzero complex number. As a point in the complex 
plane, z can be represented in polar coordinates. Let r = |z| and @ be the 


EXAMPLE 3 


Theorem 7.3.6 
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angle that the line segment from (0, 0) to (x, y) makes with the positive x- 
axis. Then x = r cos 6, y = r sin 6, and z = r(cos 6 + isin 6). This is called 
the polar form of z, and 6 is called the argument of z, denoted by arg z. 

Notice that there is a subtlety here. There are many different choices for 
6: any of the values 6, 6 + 27, 0 + 47, ... could also be used. We will call 
any of them arg z. 

Another inconvenience is that in the case z = 0, we can write z = 
|z|(cos 6 + isin 6), for any angle 6. For this reason, we do not define arg 0. 


z=r(cos@ + i sin@) 


A convention: in writing a complex number in polar form, it is customary 
to write the i before sin 6, whereas in standard form, we write a + bi with 
i written last. 


We will write z = —2 + 2V3i in polar form. First we see that |z| = 4, so 
that z = 4(— 1/2 + V3/2i). Hence cos 6 = —1/2 and sin 6 = V3/2, so 6 = 


2 We thus have z = 4(cos a + isin ="). 


The following result shows how to compute the product of complex 
numbers when expressed in polar form. 


Let z, w € C, where z = r(cos 6 + isin 0) and w = s(cos ¢ + isin ¢). Then 
zw = rs(cos(@ + ¢) + isin(@+ ¢)). 


PROOF: Using the definition of multiplication in C and some well-known 
trigonometric formulas, we have 


zw = rs cos 0cos ¢ + irs cos Osin ¢ + irs sin Ocos d+ i*rs sin Osin d 
=rs(cos @cos ¢ — sin @sin d) + irs(cos 6 sin ¢ + sin 6 cos d) 
= rs(cos(6 + @) + isin(@+ ¢). == 


The following corollary, known as DeMoivre’s Theorem, gives a conve- 
nient way of computing positive integer powers of a complex number. 
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DeMoivre’s Theorem Let z = r(cos 6 + i sin 6) © C. Then for any n € 
Z*, z" = r'(cos né + isin n6). 


PROOF: The proof is by induction on n. The result obviously holds for n = 
1. Suppose it is true for a positive integer k. Then z‘ = r*(cos k@ + i sin 
k6), and so 


zktl = zkz 
= r‘r(cos(kO + 0) + isin(ké + 6)) 
= r*l(cos(k + 1)6 + isin(k + 1)6). 


Therefore, the formula holds for k + 1, and so for all positive integers n by 
induction. 


We compute z5 for z = —2 + 2V3i. As we saw in Example 3, in polar form, 


z= 4 (cos2F + isin%2), 


So by DeMoivre’s Theorem, 


107 . 107 
5 = 4s : 
rP=4 (cos 3 + isin =) 


= 210 (cos 2 + isin 2) 


3 
(4-8) 
= —2°(1 + V3). 


Complex Roots 


An important application of DeMoivre’s Theorem is the computation of nth 
roots of complex numbers. 


Definition 7.3.8 Let z € C andn € Z". An nth root of z is a complex 
number w such that w” = z. 


Let 04 z =r(cos 6+ isin 6) € C,n € Z*. Then z has exactly n nth roots, 
given by 


- w( (24224) zs, (24 #4)) 
W.=TFr cos ars + isin s 


fork = 0,1,2,...,n— 1. 


EXAMPLE 5 


Theorem 7.3.10 
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PROOF: By DeMoivre’s Theorem, it follows easily that wz = z for all k, 
where 0 = k =n — 1. It is also easy to see that the w,s are distinct. So z 
has at least n nth roots. 

Let w = s(cos @ + isin ¢) be an nth root of z. Then 


w" = s"(cosngd + isinnd) 
=Z 
=r(cos 6+ isin @) 


so s" = rand therefore s = r'". 

Also, cos n@ = cos @ and sin nd = sin 0, so nd = 6 + 2t7 for some t € 
Z. Then ¢ = (6 + 2t7)/n, and by the Division Algorithm (Theorem 5.3.1), 
there exist integers g and k such that t = nq + k where 0 =k <n. It follows 
that @ = (0 + 2km)/n + 2qz, and so w = w,. We can now conclude that the 
w,S give all of the nth roots of z. «@ 


We compute the third (cube) roots of z = —2 + 2V3i. We know from 
Example 3 that z = 4(cos 2 + isin 22), So the third roots of z are given by 


w, = 4° (cos (2 a =a os isin 22 + 2k) ) 


for k = 0, 1, 2. We get: 


2a . 20 
= 4iB Ber ae 
Wo=4 (cos 9 isin =), 


w, = 418 (cos = + isin 2) 
147n . . 147 
w, = 418 (cos 2 + isin 4) 


An important special case of the last theorem is the computation of the 
nth roots of z = 1, called the nth roots of unity: w is an nth root of unity if 
w= 1. 


For each n € Z’, there are exactly n nth roots of unity, given by 


Q2ak 27k 
@, = cos —— + i sin — 
n 


forO =k sn -l. 


PROOF: We leave it as an exercise. “©? 
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EXAMPLE 6 The third or cube roots of 1 are 


@ = 1, 
@, = cos — + isin 2h, 
: 3 3 5 a 
hyped eect a UNS, 
@) 3 Ll 4 0) pae 
@) 
Mo 
M2 


EXAMPLE 7 The fourth roots of 1 are 1, i, —1, —i. 


Corollary 7.3.11 Let n © Z*. Let w = cos = + i sin 22, Then the nth roots of unity are 1, 
w, w,@,..., 07). 


PROOF: We leave it as an exercise. © * 


The complex number » of Corollary 7.3.11 is called a primitive nth root 
of unity, since every other nth root of unity is a power of w. 


“© HISTORICAL COMMENTS: EXTENDING THE COMPLEX NUMBERS 


Our definition of the complex numbers, as a set of ordered pairs of real 
numbers, was given by the Irish mathematician W. R. Hamilton (1805-1865). 
Of course, mathematicians had long been using them, taking i as V-~1, an 
“imaginary” number. But Hamilton’s definition took the mystery away by 
giving a logical basis for C. Again, it is surprising that this milestone preceded 
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any logical foundation for R, just as Dedekind’s work predated Peano’s. 
(See the Historical Comments in Section 7.2.) 

In true mathematical spirit, Hamilton wondered if it were possible to 
continue extending the number system. Specifically, he searched for a way 
to define multiplication on triples (a, b, c) (or sums a + bi + cj for some /) 
of real numbers that would result in a field. It turned out to be a harder 
proposition than he imagined. He spent 15 years on the problem, to the 
point where his daughters would tease him at the dinner table: ‘“‘Daddy, can 
you multiply triples?’”’ There is no record of his reaction. As the story goes, 
he was walking with his wife one day across the Brougham Bridge in Dublin. 
Apparently his mind was elsewhere because he had a flash of insight: don’t 
multiply triples, but rather quadruples. Then he stopped and carved a formula 
into the bridge: 


2 =p? =k? = ijk =-1. 


Hamilton had discovered what he came to call the quaternions H. To be 
specific, H = {a + bi + cj + dk |a, b, c, d € R}, where 2 = j? = k? = —1, 
and interestingly, multiplication is not commutative: ij = —ji = k, ik = 
—ki = —j, and jk = —kj = i. Hamilton was able to prove that, except for 
commutativity, all the axioms of a field hold for H. He had thus extended 
C, at the price of only one of the axioms. 

Hamilton was so entranced by his discovery that he spent the rest of his 
life studying this new set of numbers and espousing their use. Although he 
may have understandably overestimated their importance, they have proven 
extraordinarily useful in many areas, notably in modern physics with its 
conception of space time as a four-dimensional universe. 

The reason for Hamilton’s struggle became clear only later, when F. G. 
Frobenius (1849-1917) proved that the problem was not a lack of creativity 
on Hamilton’s part. Frobenius showed that the only way to extend the 
multiplication on R to n-tuples (a,, a, ..., a,) of reals and preserve the 
field axioms is through Hamilton’s construction of C, and the only way to 
do so without commutativity is by his construction of H. This is an amazing, 
far-reaching result, and is a testament to Hamilton’s achievement. Not only 
did he find the only two ways to naturally extend the real numbers in the 
manner of the complex numbers, but by removing the ‘“‘natural’’ axiom of 
commutativity, encouraged the study of abstract, algebraic objects. 


Exercises 7.3 


1. (a) Verify that addition in C is associative and commutative. 
(b) Verify that multiplication in C is associative and commutative. 
(c) Verify the distributive laws in C. 
(d) Prove that every nonzero element in C has a multiplicative inverse. 
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2. Let z = —4 + 2i and w = 6 — 5i. Compute the following and express 
your answer in the forma + bi. 
(a) 2z + Sw (b) zw (c) 
-1 -1 
(e) z (f) w (2) 
. Prove Lemma 7.3.4. 
. Let z © C. Prove that z = Z if and onlyif z ER. 
Let 21, Z) © C. Prove that |z; + z,| S |z,| + |zo| and |z:z.| = |zillzol. 


. Prove that if z © C and k is any positive integer, then z* = z’. 


2 (d) w3 
(h) (@ + w)’ 


NX 


=(N 


NHWn sk W 


. Express the following complex numbers in polar form: 
(a) —3 + 3i (b) 6 — 2V3i (c) —5 (d) —2i 
(e) -3 - 3 (f) 6 + 2V3i (g) 5+i (h) 6 — 2i 
8. Let 0 # z = r(cos 0+ isin 6), w = s(cos @ + i sin ¢). 


(a) Prove that z7) = * (cos(—6) + i sin(—8)). 


(b) Let n € Z*. Prove that z" = + (cos(—n6) + i sin(—n6)). 


(c) Prove that : = = (cos (@ — 6) + isin(d — 8)). 


9. Let z = 3V3 — 9i and w = —2 + 2i. Compute: 
(a)rz* (b) 2° (c) w® (d) w 
(e) z° (f) 7128 (g) zip l2 (h) w ~100 
10. Let z and w be as in Exercise 9. 
(a) Compute the 4th roots of z. 
(b) Compute the 3rd roots of w. 

11. Prove Theorem 7.3.10. 

12. Find the 5th, 6th, and 8th roots of unity. 

13. Letn € Z*. 

(a) Prove that the sum of the nth roots of unity is zero. 
Hint: Factor the polynomial x" — 1. 
(b) Determine the product of the nth roots of unity. 

14. Suppose that u € C is a primitive nth root of unity and w € Cis an nth 
root of z, 0 # z € C. Prove that the nth roots of z are w, wu, wu’,..., 
wu! 

15. Prove Corollary 7.3.11. 


Discussion and Discovery Exercises 


D1. The complex number w is a primitive nth root of unity if every other 
nth root of unity is a power of w. 
(a) Which of the third roots of unity are primitive? Which of the fourth? 
Fifth? Sixth? Seventh? 
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(b) Based on your answers to part (a), speculate on the general case: 
when is an nth root of unity primitive? Can you prove that your 
answer is correct? 


D2. The exponential function e* is defined on R, but can be extended to a 
function on C by taking e? = e**” = e*(cos y + isin y). (There are, of 
course, many other ways to extend to C.) 

(a) Verify that this particular extension satisfies the rules of exponentia- 


tion: 
ee” = eww 
alt 
e 


(b) Give a simple expression for e’®, for 6 € R. (This is known as 
Euler’s Formula.) 

(c) Show that if z # 0, then z = re’, where r = |z| and 6 = arg z. 

(d) In part (a), you were not asked to verify the expected rule (e?)” = 
e*”. Why not? 


POLYNOMIALS 
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The Algebra of Polynomials 


In this chapter we begin a systematic study of polynomials. Our main goal 
is to develop some techniques for solving equations of the form f(x) = 0 
where f(x) is a polynomial. The coefficients of such polynomials will usually 
be real numbers. Sometimes we will restrict our attention to polynomials with 
just rational coefficients or even integer coefficients. We will also consider 
polynomials whose coefficients are complex numbers. 

In fact, we will first look at the set of polynomials whose coefficients are 
in some unspecified field. The definition of a field was given in Section 7.1. 


Definition 8.1.1 Let F be a field. A polynomial over F in the variable 
x is an expression of the form 


f(x) = ay + ax + ax? +... + ax" 


where a, @),@),....,4@,€& FandnE€ Z,n=0. 

The elements ap, a;, @,..., a, are called the coefficients of f(x). If 
a, ~ 0, so that n is the highest power of x that appears, then a, is called 
the leading coefficient of f(x); ap is called the constant term of f(x); and 
n is called the degree of f(x), denoted deg f(x). 

If f(x) = ay) # 0, then f(x) is called a nonzero constant polynomial 
and has degree 0. If f(x) = 0, f(x) is called the zero polynomial. The 
zero polynomial is not assigned a degree. 

We let F[x] denote the set of all polynomials with coefficients in F. 
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What makes F [x] interesting to study as a set is that it has many properties 
similar to the set Z of integers. First F [x] has two binary operations; namely, 
ordinary addition and multiplication of polynomials. 

The formal definitions of addition and multiplication in F[x] are as 
follows. Let f(x) = a) + ax + ax? +... + a,x" and g(x) = by + bx + 
box? +... + b,x” be in F[x]. If n * m, say n > m, then we can also write 
g(x) = by + byx + box? +... + b,x" where b; = 0 for all i > m. Notice that 
the degree of g(x) is still m, but writing g(x) this way makes the notation 
needed to define addition in F [x] easier. 

We define 


f(x) + g(x) = (a + bo) + (a + by)x + (a) + dy)x? + 22. + (ay + By) x" 


and 
f(x)g(x) = aobo + (aob; + aibo)x + (aoby + ayb, + agby)x? +... + anbnxX"*™. 


These binary operations are really just defined in the usual way we 
define addition and multiplication of real-valued polynomial functions. The 
coefficient c; of x‘ in the product f(x)g(x) can be expressed by the follow- 
ing formula: 


Ci = Agb; a a,bj-1 + anbj-> Pesce a;bo 


where it is understood that a, = 0 if i > n and b; = Oif j > m. 
It is easy to see that addition and multiplication in F[x] are associative 
and commutative. For example, if f(x), g(x), h(x) € F[x], where 


f(x) = ay + x + x? +... 4+ a,x", 
g(x) = by + Dix + byx? +...45,x", 


and 
h(x) = cy + CX + Cx? +... + yx", 
then 


(f(x) + g(x)) + A(x) = ((ao + bo) + (ay + Bi) x + (az t+ by) x? +... + (Gn + by) x") 

+ (Cot OX + Ox? +...4+06,X") 

= ((ao am bo) + Co) + ((a, + by) + C,)x 
+ ((ay + by) +07)? +... + (Gn + By) + en) x" 

= (a + (bo + co)) + (a Ae (b, + C,))x 
+ (a, + (by +.€))x? +... + (ay + (By + Cn) x" 

= Ay tax t+ anx? +... + anx" + (bo + co) + (bi +c) x 
+...4(b, + C,)x" 


= f(x) + (g(x) + A(x). 


Notice that we needed the fact that addition in F is associative. 

Similar proofs show that addition is commutative and multiplication is 
associative and commutative. (See the exercises.) 

The zero polynomial is the additive identity of F[x], and the addi- 
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tive inverse of ay + ax + ax’ +... + a,x" is —(a) + (-a,)x + 
(-a))x? +... + (-a,)x". 

Since F has an identity 1, the constant polynomial f(x) = 1 is the multipli- 
cative identity of F [x]. We leave as an exercise the fact that distributive laws 
hold in F [x]. 


If f(x) = 5x3 + 7x? — 9x + 2 and g(x) = 4x5 — x3 + 8 in R[x], then 
f(x) + g(x) = 4x9 + 4x3 + 7x? — 9x + 10 
and 
f(x)g(x) = 20x® + 28x’ — 41x® + x° + Oxt + 38x32 + 56x*° — 72x + 16. 


In the next example, we consider polynomials whose coefficients are in 
the field Z; of integers mod 5. (See Section 5.5 for the definition of Z,,.) 


If f(x) = [3]x? + [2]x + [4] and g(x) = [1]x? + [4]x? + [1] in Z;[2], 
then 

f(x) + g(x) = [1]}x? + [2]}x? + [2]x 
and 


f(x)g(x) = [B]x° + [4]xt + [2]x3 + [4]x? + [2]x + [4]. 


The following theorem gives us information about the degree of the sum 
and product of two polynomials. 


Let F be a field and let f(x), g(x) € F[x], where f(x) ¥ 0, g(x) ¥ 0, and 
f(x) + g(x) # 0. Then 


1. deg(f(x) + g(x)) S max{deg f(x), deg g(x)}. Moreover, if deg f(x) # 
deg g(x), then equality holds. 
2. deg(f(x)g(x)) = deg f(x) + deg g(x). 


PROOF: Let f(x) = do + ax + mx? +... + a,x" and g(x) = by + Dix + 
box? +... + 5,,x, where a, ~ 0 and b,, # 0. 


1. If n > m, then by the definition of addition in F [x], x" is the highest 
power of x that appears in f(x) + g(x), so deg(f(x) + g(x)) =n. Similarly, 
if m > n, then deg(f(x) + g(x)) = m. If n = m, then the highest power of 
x that appears in f(x) + g(x) is x” unless a, = —b,, in which case the highest 
power of x that appears is less than n. 

2. We leave this as an exercise. “> 


As pointed out previously, many properties of the set F[x] are similar 
to properties of the set of integers Z. Recall that the only elements of Z 
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that have multiplicative inverses are 1 and —1. Similarly, the only elements 
of F [x] that have multiplicative inverses are the nonzero constant polynomi- 
als. This fact is a consequence of Theorem 8.1.2 and is left as an exercise. 
In particular, it follows that F [x] is not a field. 


The Division Algorithm 


Another point of similarity is that there is a Division Algorithm for Polynomi- 
als similar to the Division Algorithm for Integers, as stated in Theorem 5.3.1. 
Also, the notion of an irreducible polynomial, to be defined in Section 82, 
mirrors the concept of a prime number. We will first prove the Division 
Algorithm for Polynomials and then derive some consequences having to 
do with finding zeros of polynomials. 

Throughout the rest of this chapter, F will denote a field. 


Division Algorithm for Polynomials Let f(x), g(x) € F[x], g(x) # 0. 
Then there exist unique polynomials g(x) and r(x) in F[x] such that f(x) = 
g(x)q(x) + r(x), where r(x) = 0 or 0 S deg r(x) < deg g(x). 


PROOF: We first prove the existence of q(x) and r(x). Let S = {h(x) € 
F[x] | h(x) = f(x) — g(x)q(x), for some g(x) € F[x]}; that is, S is the set 
of all polynomials in F [x] that can be expressed as f(x) minus a multiple 
of g(x). 

Clearly, S # ©. If the zero polynomial is in S, then f(x) = g(x)q(x) 
for some q(x) € F[x]. The theorem then follows with r(x) = 0. 

Suppose the zero polynomial is not in S$. Then let r(x) be a polynomial 
in § having the smallest degree of all polynomials in S. Such a polynomial 
exists by the Well-Ordering Principle. We have f(x) = g(x)q(x) + r(x) for 
some q(x) € F[x]. So it remains to prove that deg r(x) < deg g(x). 

Suppose that deg r(x) = deg g(x). Let 


@(x) = by + Bix + Dox? +... + Dx” 
where b,, # 0, and 

r(x) = cy + ex + Ex? +... 4+ 6x! 
where c, ~ 0. Let 

r(x) = r(x) — cbalx"g(x). 
(Note that ¢ = m.) Then 

r(x) = fx) — g)(qQ) + Ciba x"), 


so r(x) € S. But deg r,(x) < t = deg r(x), a contradiction to the fact that 
r(x) has minimal degree in S. Therefore we must have deg r(x) < deg g(x). 
This completes the proof that g(x) and r(x) exist. We leave the proof of 
uniqueness as an exercise. *» 
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If f(x) = ao + ax + ax? +... + a,x" © F[x] andc € F, then we write 
f(c) to denote the element ap + ajc + ajc? +... + a,c" © F. We say that 
we are substituting c for x in f(x) or evaluating f(x) at c. Of course, f(c) is 
in F, not F[x]. 

If f(x) = g(x) + A(x) where g(x) and A(x) are in F[x], then by the 
definition of addition in F[x], it follows that f(c) = g(c) + h(c). Similarly, 
if f(x) = g(x)h(x), then the definition of multiplication in F[x] implies that 


f(c) = g(c)h(c). 


Let f(x) € F[x] and c € F. Then there exists g(x) in F[x] such that f(x) = 
(x — c)g(x) + fle). 


PROOF: Applying the Division Algorithm for Polynomials with g(x) = x — 
c, there exist g(x) and r(x) in F[x] such that f(x) = (x — c)q(x) + r(x) where 
r(x) = 0 or deg r(x) < deg(x — c) = 1. Hence r(x) must be a constant 
polynomial, say r(x) = d. Substituting c for x we get d = r(c) = f(c). The 
corollary then follows. ©» 


Let f(x) = x7 + 7x? — 2x + 1 © R[x]. Then if f(x) is divided by x + 2 the 
remainder is f(—2) = 25. 


Zeros of Polynomials 


Definition 8.1.5 Let f(x) € F[x] and c € F. We say that c is a zero of 
f(x) if f(c) = 0. We also say that c is a root or a solution of the equation 


f(x) = 0. 


The following result, known as the Factor Theorem, is an important 
result relating zeros of polynomials to factoring a polynomial. 


Factor Theorem Let f(x) € F[x] and c € F. Then c is a zero of f(x) if and 
only if there is a q(x) in F[x] such that f(x) = (x — c)q(x). 


PROOF: The proof is immediate from Corollary 8.1.4. © 


Suppose we wish to find the zeros of the polynomial f(x) = x7 -— 2x +1¢€ 
R[x]. A quick inspection yields the fact that 1 is a zero of f(x) and thus 
x — lisa factor of f(x). Dividing x — 1 into f(x) we get f(x) = (x — 1)(x? 
+ x — 1). Next, using the quadratic formula, we see that the zeros of x? + 
x — 1 are (—1 + V5)/2. But any zero of a factor of f(x) is also a zero of 
f(x) itself. Thus 1, (—1 + V5)/2, and (—1 — V5)/2 are zeros of f(x). 
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Let f(x) = x? + 1 © R[x]. Clearly f(x) has no zeros in R. But since R C C, 
we can consider f(x) € C[x]. As we have seen, f(x) has two zeros in C, i 
and —i, and f(x) factors into (x — i)(x + i). 


These examples leave some unanswered questions. First, is it possible 
that we have not found all of the zeros of these polynomials? For example, 
could these polynomials be factored in another way? The answer to both of 
these questions is no. That we have found all of the zeros of these polynomials 
follows from the next corollary: a polynomial of degree n has at most n 
zeros. The fact that factorization of polynomials is essentially unique will be 
proved in the next section. 

Another question involves how we found the zero x = 1 in Example 4. 
In that case a little bit of trial and error calculation yielded the answer fairly 
quickly. But what if the coefficients or the degree of the polynomial had 
been larger? Simple trial and error computation would have been impractical, 
and besides, there may be no integer zeros at all. We will explore some of 
these questions in Section 8.3. 


Let f(x) € F[x], deg f(x) = n. Then f(x) has at most n zeros in F. 


PROOF: The proof is by induction on n. Let P(n) be the statement “Any 
polynomial f(x) in F [x] of degree n has at most n zeros in F.” If n = 0, then 
f(x) is a nonzero constant function and thus has no zeros in F. Therefore, 
P(O) is true. If m = 1, then f(x) = a,x + ay, where a, and ay are in F and 
a, # 0. It is easy to see that —aj'a,is the unique zero of f(x) and so P(1) 
is true. 

Now suppose P(n) is true. Then any polynomial in F [x] of degree n has 
at most n zeros in F. We now wish to prove that P(m + 1) is true; that is, 
we want to show that any polynomial in F [x] of degree n + 1 has at most 
n + 1 zeros in F. 

Let f(x) have degree n + 1 in F[x]. If f(x) has no zeros in F, there is 
nothing to prove. 

Suppose then that f(x) has at least one zero c in F. Then by Corollary 
8.1.6, f(x) = (x — c)g(x) for some g(x) in F [x]. Since deg f(x) =n + 1 and 
deg(x — c) = 1, deg g(x) = n. Therefore, by the induction hypothesis, g(x) 
has at most n zeros in F. If dis a zero of g(x), it is clear that dis a zero of 
f(x). Conversely, if d is a zero of f(x), d # c, then 0 = f(d) = (d — c)g(d) 
implies that d is a zero of g(x). 

Therefore, the zeros of f(x) are c together with any zeros of g(x). It now 
follows that f(x) has at most n + 1 zeros in F. The induction proof is now 
complete. @ 


Let p be an odd prime number and Z, the set of congruence classes mod p. 
Recall that, by Theorem 7.1.9, Z, is a field. Let [a] © Z,. Consider the 
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polynomial f(x) = x* — [a] € Z,[x]. By Corollary 8.1.7, f(x) has at most two 
zeros in Z,. If [b] is a zero of f(x), then it is clear that [—b] is also a zero. 
Therefore, if (a, p) = 1, f(x) has either no zeros or two zeros in Z,. Both 
cases can occur. For example, if p = 7, then x? — [3] € Z,[x] has no zeros 
in Z; and x — [2] has two zeros. 

We can express these results in the language of congruences. If [b] € Z, 
is a zero of x” — [a], then [b]}’ = [a] or b = a(mod p). In other words, x? — 
[a] has a zero in Z, if and only if there exists b in Z such that b’ = a(mod 
p). It follows then that for a given odd prime p and a given a in Z such that 
(a, p) = 1, the congruence x? = a(mod p) has either two incongruent solutions 
or no solutions. The two examples cited in the previous paragraph imply 
that there is no integer b such that b’ = 3(mod 7), and there exist 2 integers 
b, not congruent to each other mod 7, such that b? = 2(mod 7). In fact, 3 
and 4 are two such solutions. 


Let f(x) = x? — 2x + 1 © R[x]. In Example 4 we found three zeros of f(x). 
It now follows from Corollary 8.1.7 that these are the only zeros of f(x). 


“= HISTORICAL COMMENTS: SOLUTIONS OF POLYNOMIAL 
EQUATIONS BY RADICALS 


The quadratic formula has been known for centuries; even the Babylonians 
of 2000 B.c. solved quadratic equations. For a long time negative solutions 
were disregarded and full acceptance of complex numbers as solutions did 
not gain hold until the 19th century. 

The beauty of the quadratic formula is that it provides explicit solutions 
of a quadratic equation in terms of the coefficients of the quadratic polyno- 
mial. We say that the quadratic formula gives a solution by radicals for the 
equation ax? + bc + c = 0 because the solution involves just the algebraic 
operations of addition, subtraction, multiplication, division, and taking a 
square root; explicitly, the square root of b? — 4ac. 

A similar formula for solving a cubic equation ax? + bx? + cx + d= 0 
was discovered by Niccolo Tartaglia (1499?-1557) in the 16th century. Tar- 
taglia did not want his formula to be known. It was quite common in those 
days for mathematicians to keep their method of solving a problem a secret so 
that they could challenge other mathematicians to solve it. In 1539, however, 
Tartaglia revealed his method to the mathematician Girolamo Cardano 
(1501-1576) on the condition that Cardano would not divulge it to anyone. 
Cardano came to the conclusion that Tartaglia’s method of solving a cubic 
was the same as that of Scipione dal Ferro (1465-1526), a professor of 
mathematics at the University of Bologna. So Cardano published his own 
version of the method and gave credit to both Tartaglia and dal Ferro for 
having come up with the same solution. Needless to say, Tartaglia was not 
pleased and a dispute ensued as to who came up with the formula first, 
Tartaglia or dal Ferro. 
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The cubic formula is similar to the quadratic formula in the sense that 
it gives a formula for the roots in terms of the coefficients, using only algebraic 
operations of addition, subtraction, multiplication, division, and taking 
square roots and cube roots. However, the formula is much more compli- 
cated. 

Specifically, the equation x* + bx’? + cx + d = Ocan be solved as shown 
next. (We can always make the leading coefficient 1 by dividing through by it.) 

Let 


A= Bc? md 4b'd — 27d’ + 18bcd, 


ae c- Pa+SV-3A. 
= =p +3 be - Pa -SV—38, 
2 2 
Letw = —!+ V3. the cubic third root of unity. Then the zeros, a,, a, 


a; of the cubic equation x* + bx* + cx + d = 0 are given by 
a =5[-b Vy y+ vo], 
= = b+ar°Vy + wV6], 
a= 3 ee + oVy + wV6). 


Since there are three cube roots of a complex number (see Section 7.3), we 
have to specify which cube roots of y and 6 are chosen. In fact we can choose 
any cube root of y and thenthe cube root of 6 is chosen so that the equation 
WyW6 = b? — 3c is satisfied. 

For example, if we solve the equation x* — 3x + 2 = 0, we get: A = 0, 
y = —27, 6 = —27, thus giving the solutions a, = —2, a, = a; = 1. 

These solutions of course could have been obtained by noting that 1 is 
a solution, then factoring x — 1 and applying the quadratic formula to the 
remaining factor. In fact, if thereis an easily obtained integer solution, then 
this latter method is preferable to applying the cubic formula. 

Consider the equation x? + 2x? — x — 2 = 0. Using the cubic formula 
just given, we get the solution a, = {2 + VW10 + 9V3i + V10 — 9V3i}- 
Now it can be shown that this expression for a is in fact equal to 1! The 
other two solutions are —1 and —2. Clearly, the cubic formula is not always 
the most efficient method for solving a cubic equation. 

Shortly after the cubic equation was solved, a solution by radicals of the 
quartic equation ax* + bx* + cx? + dx + e = 0 was discovered by Lodovico 
Ferrari (1522-1565), a pupil of Cardano. This formula is even more compli- 
cated than the cubic and will not be discussed here. 

The next step of course was to find a solution by radicals for the quintic 
equation ax> + bx* + cx? + dx* + ex + f = 0. But no such formula was 
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found despite repeated efforts. By the 19th century, mathematicians such as 
Lagrange and Gauss became convinced that no such formula existed for the 
quintic equation and indeed for any polynomial equation of degree greater 
than four. Their intuition proved to be correct and a proof that the general 
polynomial equation of degree greater than four was not solvable by radicals 
was given by Niels Henrik Abel (1802-1829) in the early 19th century. 


Exercises 8.| 


1. Let F be a field. 
(a) Prove that addition in F[x] is commutative. 
(b) Prove that multiplication in F [x] is commutative. 
(c) Prove that multiplication in F [x] is associative. 
(d) Prove that the distributive laws hold in F [x]. 


2. Theorem 8.1.2 says that if deg f(x) # deg g(x), then deg(f(x) + g(x)) = 
max{deg f(x), deg g(x)}. Write the converse of this statement. Prove 
or disprove. 


3. Let F be a field. 
(a) Let f(x), g(x) © F[x]. Prove that if f(x)g(x) = 0, then f(x) = 0 or 
g(x) = 0. 
(b) Let f(x), g(x), h(x) © F[x]. Suppose that f(x)g(x) = f(x)h(x) and 
that f(x) # 0. Prove that g(x) = h(x). 
(Note: This exercise shows that a cancellation law holds in F[x].) 


4. Prove part 2 of Theorem 8.1.2. 


5. Let F be a field. Let f(x) € F[x]. Prove that f(x) has a multiplicative 
inverse in F [x] if and only if f(x) is a nonzero constant polynomial; that 
is, there exists c € F, c ¥ 0, such that f(x) = c. 


6. Let f(x) = 8x3 — 6x? + Sand g(x) = x* + x3 — 10x + 2in R[x]. Compute: 
(a) f(x) + g(x) (b) fxg) = © fay 
7. Let f(x) = [4]x? — [6]x? + [2]x + [3] and g(x) = [2]x? + [4]x — [5] in Z,[x]. 
(a) Compute: 
(i) fx) + g(x) Gai) fx)g(x) (ali). f(®)? 
(b) Prove that the additive inverse of f(x) is the polynomial [3]x? — 
[1]x? + [S]x + [4]. 


8 Prove that g(x) and r(x) of the Division Algorithm for Polynomials 
are unique. 


9. For f(x) and g(x) given next, find g(x) and r(x) that satisfy the conditions 
of the Division Algorithm: 
(a) f(x) = x9 + 6° — x? + 14x + 1, g(x) = x? + 5 in Q[y]. 
(b) f(x) = [3]x* + [4], g@) = [3]x? + [4]x — [1] in Zs[x]- 
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10. 


11. 


12: 
13. 


14. 


15. 


16. 
17. 
18. 


19. 


20. 


21. 
22. 


Use Corollary 8.1.4 to find the remainder if x5 — 7x? + 2x — 6is divided 
by x + 4. 


Find all the real zeros of the following polynomials: 

(a) f(x) = x3 + 2x7 +2x +1 

(b) f(x) = x4 + 2x3 + 3x? + 4x +2 

(c) f(x) =x5 + 2x4 +x+2 

Find all the complex zeros of the polynomials in Exercise 11. 

Let f(x) = [2]x? + [4]x? — [1]x + [2] € Z|]. Find all the zeros of f(x) 
in Z. 

Solve the following congruences, if possible: 

(a) x? = 5(mod 11) 

(b) x? = 8(mod 13) 

For each of the following primes p, find all integers a between 1 and 


p — 1 for which the congruence x? = a(mod p) is solvable, and then 
find those a for which it is not solvable. 


ay p= 7 
(b) p= 11 
(c) p = 13 


Write the negation of Corollary 8.1.7. 
Prove that the equation y’ = 7x + 3 has no integer solutions. 


If f(x) = ay + ax + x? t+... + a,x" © F[x], we define its derivative 
Df(x) by the formula Df(x) = a, + 2a,x + 3a3x? +... + na,x"", if 
n = 1 and Df(x) = 0 if n = 0 or if f(x) is the zero polynomial, based 
on the usual formula for the derivative of a real-valued polynomial. 
(But note that the coefficients are in an arbitrary field F and not necessar- 
ily the real numbers, so we make no mention of limits.) 
(a) Prove that if f(x) has degree n = 1, then deg Df(x) Sn — 1. 
(b) Prove that if f(x) € R[x] and deg f(x) = n = 1, then deg Df(x) 
n- 1. 
(c) Give an example of the field F and a polynomial f(x) € F[x] of 
degree n for some n = 1 such that deg Df(x) <n — 1. 
(d) Let f(x), g(x) € F[x]. Prove that 
(i) Df) + g(x) = Dflx) + De(x) 
(ii) D(f(x)g(x)) = Dflx)g(x) + flx)Dg(x) 
Let a be a nonzero real number and let f(x) = x? — a. 
(a) Prove that f(x) has only one real zero, namely, a = Wa. 
(b) Find the other two (complex) zeros of f(x). 


Use the cubic formula to find the zeros of the polynomial x? + 3x + 
2=0. 


Verify that {-2 + V10 + 9V/3i + V10 — 9V3i} = 1. 


Find all of the zeros of f(x) = x4 + 3x? — 4. 


8.2 
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Discussion and Discovery Exercises 


For the first two problems, review the techniques you learned in first-year 
calculus for graphing polynomials, in particular quadratic and cubic polyno- 
mials. 


D1. Let f(x) = ax? + bx +c GE R[x], a 4 0. 
(a) Give an example of a quadratic polynomial with no real zeros, then 
one with exactly one real zero, then one with exactly two real zeros. 
(b) Prove that if b? < 4ac, then f(x) has no real zeros. 
(c) Prove that if b? = 4ac, then f(x) has exactly one real zero. 
(d) Prove that if b? > 4ac, then f(x) has exactly two real zeros. 


D2. Let f(x) = ax? + bx? + cx + dE R[x], a 4 0. 

(a) Prove that f(x) has one, two, or three real zeros. 

(b) Give an example of a cubic polynomial with exactly one real zero, 
then one with exactly two real zeros, then one with exactly three 
real zeros. 

(c) Prove that if b’ = 3ac, then f(x) has exactly one real zero. 

(d) Suppose that b? > 3ac. Show that no conclusion beyond the one 
stated in part (a) can be drawn about the number of real zeros that 
f(x) may have. 


D3. Can you deduce any general conjectures from your answers to Exercise 
15? Test your conjecture using some other primes. 


D4. Generalize Exercise 19 to the polynomial f(x) = x" — a. 


UNIQUE FACTORIZATION 


Irreducible Polynomials 


In this section we consider the question of how polynomials factor. In the 
previous section, we saw that a zero of a polynomial yields a factor of that 
polynomial. But not all polynomials have a zero in a given field. This does 
not mean, however, that such a polynomial does not factor. For example, 
the polynomial x4 + x3 + 2x* + x + 1 factors into (x? + 1)(x* + x + 1). 
Neither of these polynomials, x? + 1 or x? + x + 1, have zeros in R and 
therefore they cannot be factored any more in R[x]. (They can, however, 
be factored in C[x].) 

Polynomials that cannot be factored are called irreducible. A formal 
definition is given next. Irreducible polynomials have properties similar to 
the prime numbers. Just as the Unique Factorization Theorem for the Inte- 
gers says that every integer is the unique product of prime numbers, we 
show in this section that every polynomial in F[x] can be written as a product 
of irreducible polynomials in an essentially unique way. 
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It would be wrong to take unique factorization for granted. Not every 
set has this property. For example, consider the set § = {a + bV-5| a, 
b € Z}. Like the integers, S has two binary operations of addition and multi- 
plication. In S we can write 9 = (3)(3) and 9 = (2 + V—5)(2 — V-5). 
It is not obvious, but in fact none of the elements 3, 2 + V-5, or 
2— V-—5 can be factored in S$. So 9 can be factored in two different 
ways in §. Thus S does not have the unique factorization property. 


Definition 8.2.1 Let F be a field and let f(x) © F[x] such that deg 
f(x) = 1. Then f(x) is said to be irreducible over F if f(x) cannot be 
written as a product of two polynomials in F[x] having smaller degree 
than f(x). 


The condition that the factors have smaller degree than f(x) is to avoid 
trivial factorizations obtained by factoring out a constant. For example, 
x? + x + 1 is irreducible over R, but we could write x? + x + 1 = 3(2x? + 
2x + 2). We do not consider this a proper factorization. 

It is not sufficient just to say that a polynomial is irreducible. It is necessary 
to say that it is irreducible over a particular field. For example, x? — 2 is 
irreducible over Q but not over R and x’ + 1 is irreducible over Q and over 
R but not over C. 

In general, it is a difficult problem to determine whether a polynomial 
is irreducible or not. Any polynomial of degree 1 is clearly irreducible over 
any field. For polynomials of degree 2 or 3, we have the following result. 


Let f(x) € F[x] with deg f(x) = 2 or 3. Then f(x) is irreducible over F if and 
only if f(x) has no zeros in F. 


PROOF: We leave it as an exercise. == 


The lemma is not true in general for polynomials of degree 4 or greater. 
The polynomial x* + 2x? + 1 = (x? + 1) is an example of a polynomial 
with no zeros in R but not irreducible over R either. The implication the 
other way is always true for polynomials of degree =2, however; that is, if 
f(x) is an irreducible polynomial in F[x] and deg f(x) = 2, then f(x) has no 
zeros in F. The proof is left as an exercise. 

In order to prove the Unique Factorization Theorem for Polynomials, 
we need to introduce the concepts of divisibility and greatest common divisor 
of polynomials. The definitions and properties are similar to the analogous 
ones for integers that we discussed in Chapter 5. 


Definition 8.2.3 A polynomial f(x) is called a monic polynomial if its 
leading coefficient is 1. 


Lemma 8.2.6 
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Definition 8.2.4 Let f(x), g(x) © F[x]. We say that f(x) is an associate 
of g(x) if there is an element c in F, c ¥ 0, such that f(x) = cg(x). 


If f(x) is an associate of g(x), then it is easy to see that g(x) is an associate 
of f(x) since g(x) = c'f(x). Consequently, we can simply say that f(x) and 
g(x) are associate to each other or simply associate. 

It is easy to see that every polynomial in F[x] is associate to a monic 
polynomial and that if f(x) € F[x] is irreducible over F, then any associate 
of f(x) in F[x] is also irreducible over F. We leave the details as exercises. 


Definition 8.2.5 Let f(x), g(x) € F[x]. We say that f(x) divides g(x) if 
there exists h(x) € F[x] such that f(x)h(x) = g(x). We write f(x) | g(x). 
We also say f(x) is a divisor or a factor of g(x). 


Let p(x) € F[x] such that deg p(x) = 1. If p(x) is irreducible over F, then 
the only divisors of p(x) are the nonzero constant polynomials and the 
associates of p(x). 


PROOF: We leave it as an exercise. «* 


Note that this lemma is analogous to the result that the only divisors in 
Z of a prime number p are +1 and +p. Recall that +1 are the only elements 
of Z with multiplicative inverses and that the nonzero constant polynomials 
are the only elements of F[x] with multiplicative inverses. 


Greatest Common Divisors 


In the proof of unique factorization of integers, we used the fact that if a 
prime number divides a product, then it must divide one of the factors of 
that product. A similar result holds for irreducible polynomials. To prove 
it, we must introduce the notion of a greatest common divisor of two polyno- 
mials. 


Definition 8.2.7 Let f(x) and g(x) be nonzero polynomials in F[x]. Then 
a greatest common divisor of f(x) and g(x) is a monic polynomial d(x) 
such that 


1. d(x) divides both f(x) and g(x), 
2. if h(x) € F[x] divides both f(x) and g(x), then h(x) divides d(x). 
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Just as in the case with integers, we must prove that a greatest common 
divisor of two polynomials always exists. Note that the proof of the following 
theorem is similar to the existence theorem for greatest common divisors of 
integers (Theorem 5.3.5). 


Let f(x) and g(x) be nonzero polynomials in F[x]. Then a greatest common 
divisor of f(x) and g(x) exists and is unique. Moreover, if g(x) is the greatest 
common divisor of f(x) and g(x), then there exist polynomials s(x) and t(x) 
such that d(x) = f(x)s(x) + g(x)t(x). 


PROOF: Let S = {f(x)s(x) + g9(x)t(x) | s(x), (x) € F[x]}; S is the set of all 
polynomials in F[x] that can be written in the form f(x)s(x) + g(x)t(x) for 
some polynomials s(x) and ¢(x) in F[x]. Since f(x) = f(x)1 + g(x)0 and 
g(x) = f(x)0 + g(x)1, both f(x) and g(x) are in S and therefore S is a 
nonempty set. 

By the Well-Ordering Principle, there is a polynomial d)(x) in Sofsmallest 
degree. Any constant times d)(x) is still in § and has the same degree as 
d(x). Multiplying d(x) by the reciprocal of its leading coefficient gives a 
monic polynomial d(x) of smallest degree in S. We claim that d(x) satisfies 
the definition of a greatest common divisor of f(x) and g(x). 

Now since d(x) € S, there exist polynomials s(x) and t(x) in F[x] such 
that d(x) = f(x)s(x) + g(x)t(x). 

We will show that d(x) divides f(x). By the Division Algorithm, there 
are polynomials g(x) and r(x) such that f(x) = d(x)q(x) + r(x) where 
r(x) = 0 or deg r(x) < deg d(x). We then have 


r(x) = f(x) — a(x)q(x) 
= fx) — (fa)s(x) + g(x)t(x))a(x) 
= fe) — s(x)q(X)) — B(x)tx)q(x). 


This implies that r(x) isin S. It follows that r(x) = 0, since d(x) has minimal 
degree in S. Hence f(x) = d(x)q(x) and d(x) divides f(x). 

A similar proof shows that d(x) divides g(x). 

Let h(x) € F[x] such that h(x) divides both f(x) and g(x). It follows from 
Exercise 8 that h(x) divides f(x)s(x) + g(x)t(x) = d(x). Hence d(x) is a 
greatest common divisor of f(x) and g(x). 

The proof of uniqueness is left as an exercise. ©# 


We will write g.c.d. to abbreviate greatest common divisor. 

Now we come to the problem of actually computing the g.c.d. of two 
polynomials. Recall that in Section 5.3, we had an algorithm for computing 
the g.c.d. of two integers by repeated application of the Division Algorithm 
for Integers. A similar process works by polynomials. The key result that 
makes this process work is the fact that if f(x) = g(x)h(x) + r(x), then the 
g.c.d. of f(x) and g(x) is equal to the g.c.d. of g(x) and r(x). (See Exercise 13.) 


EXAMPLE | 
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Since our main focus in this section is the Unique Factorization Theorem 
for Polynomials, we content ourselves with one example of how this algorithm 
works rather than a statement of it in general terms. 


Let f(x) = x6 + x5 — 4x4 + 2x3 — 11x? + x — 6 and g(x) = x* — 4x3 - 
20x? — 4x — 21. Applying the Division Algorithm to f(x) and g(x), we get 


F(x) = g(x) (x? + 5x + 36) + 250(x3 + 3x? + x + 3). 


By the previous remarks, then, the g.c.d. of f(x) and g(x) is equal to the 
g.c.d. of g(x) and 250(x? + 3x? + x + 3) and this in turn is equal to the g.c.d. 
of g(x) and x? + 3x? + x + 3. 

Applying the Division Algorithm again, this time to g(x) and x? + 
3x? + x + 3, we obtain the equation g(x) = (x3 + 3x? + x + 3)(x — 7). 
From this equation we can conclude that the g.c.d. of g(x) and x3 + 3x? + 
x+3isx> + 3x2? + x4 3. 

Thus the g.c.d. of f(x) and g(x) is x3 + 3x? +x + 3. 

Another method for finding the g.c.d. of two polynomials is to factor 
them. In this example, we can factor f(x) and g(x) as follows: f(x) = (x — 
2)(x + 3)(x? + 1) and g(x) = (x + 3)(x — 7)(xX? + 1). 

Now it is easy to see that the g.c.d. of f(x) and g(x) is (x + 3) 
(x? +1) = x9 4+ 3x? + x4 3, 

Note that we did not show how the factorization of f(x) and g(x) was 
obtained. Factoring a polynomial, even one of relatively low degree, is in 
general a difficult problem. We discuss some methods for doing so later in 
this section and in the next section. 


Definition 8.2.9 Two polynomials f(x) and g(x) in F[x] are called rela- 
tively prime if the g.c.d. of f(x) and g(x) is 1. 


Let f(x), g(x) € F[x]. Then f(x) and g(x) are relatively prime if and only if 
there exist polynomials s(x) and t(x) such that f(x)s(x) + g(x)t(x) = 1. 


PROOF: If f(x) and g(x) are relatively prime, then the result follows immedi- 
ately from Theorem 8.2.8. Conversely, suppose that there exist polynomials 
s(x) and t(x) such that f(x)s(x) + g(x)t(x) = 1. Then 1 is an element of the 
set S defined in the proof of Theorem 8.2.8. But it was shown in the proof 
of Theorem 8.2.8 that the g.c.d. of f(x) and g(x) is the monic polynomial of 
smallest degree in S. Since 1 is monic and deg 1 = 0, no monic polynomial 
in S can have smaller degree than the polynomial 1. Hence 1 is the g.c.d. of 
f(x) and g(x). © 
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Let f(x), g(x), h(x) © F[x]. Suppose that f(x) divides g(x)h(x) and f(x) and 
g(x) are relatively prime. Then f(x) divides h(x). 


PROOF: The proof is similar to the proof of Theorem 5.3.9 and is left as an 
exercise. ©) 


Let p(x) be an irreducible polynomial in F [x]. If p(x) divides f(x)g(x) where 
f(x), g(x) € F[x], then p(x) divides f(x) or p(x) divides g(x). 

PROOF: The proof is similar to the proof of Proposition 5.4.4 and is left as 
an exercise. 


Let p(x) be an irreducible polynomial in F[x]. If p(x) divides fi(x) - 
fi(x) « ... + fine), where each f(x) © F[x], then p(x) divides f(x) for 
some i = 1,2,...,m. 


PROOF: We leave it as an exercise. ©» 


The Unique Factorization Theorem 


We now come to the main result of this section, the Unique Factorization 
Theorem for Polynomials. The reader should refer to the proof of Theorem 
5.4.7, the Unique Factorization Theorem for Integers, since the proofs of 
these two theorems are very similar. 


Unique Factorization Theorem for Polynomials Let f(x) € F[x], deg 
f(x) = 1. Then f(x) is irreducible or is a product of irreducible polynomials 
in F[x]. Moreover this product is unique in the following sense: if 


FX) = Pi) * prlX) +. PX) 


= q(x) : qr(x) ae u(x) 
where each p(x) and q,(x) is irreducible in F[x], then m = n, and after 
renumbering, if necessary, p;(x) is associate to q(x) for eachi = 1,2,...,m. 


PROOF: The proof that f(x) is a product of irreducible polynomials proceeds 
by induction on deg f(x). Moreover, we use the Second Principle of Induction 
(Theorem 5.2.3). If deg f(x) = 1, then f(x) is already irreducible and so there 
is nothing to prove. 

Suppose the theorem is true for all polynomials of degree less than n. 
Assume deg f(x) = n. If f(x) is irreducible, there is nothing to prove. If not, 
we can write f(x) = g(x)h(x) where deg g(x) < n and deg h(x) < n. By 
the induction hypothesis, both g(x) and h(x) are products of irreducible 
polynomials. Hence f(x) is a product of irreducibles. 

We now prove uniqueness. Suppose 


F(X) = pil) + po(x) +... + P(X) 
7? q(x) . qx) Tet Onl X) 


EXAMPLE 2 


EXAMPLE 3 


EXAMPLE 4 
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where p(x) and q,(x) are irreducible in F[x]. We can assume, without 
loss of generality, that m =< n. Since p,(x) divides f(x), p(x) divides 
q(x) + q(x): ... + q(x). Therefore by Corollary 8.2.13, p,(x) divides 
q(x) for some j = 1, 2, ... , n. Renumber the q,(x)s so that q;(x) = 
qi(x). It follows that p,(x) and q,(x) are associate. (See Exercise 19.) So 
qi(x) = c)p,(x) for some constant c,. 

We thus get the equation 


Pr(X) * pal) + - «+ P(X) = CPi) * G(X) +--+ Qa). 
By Exercise 3(b) of Section 8.1, we can cancel p;(x) to get 


P2(x) Tee * PX) = C1" r(x) Ser G(X), 
where c, € F. Proceeding in the same way and renumbering if necessary, 


after m steps, we get that p,(x) and q(x) are associate for i = 1,2,...,m. 
If m <n, we would get 


1 =C,* (ae Cin * Anvi(X) + i qrlX), 


where c, C2, ..., Cn © F, a contradiction. Therefore m = n and the proof 


oe 


is complete. «* 


Let f(x) = x* — 2x3 — x? + 4x — 2 € Q[y]. Since 1 is a zero of this polynomial, 
x — lisa factor of f(x). 

Dividing by x — 1, we get f(x) = (x — 1)(x* — x? — 2x + 2). We see 
that 1 is also a zero of x° — x* — 2x + 2. Dividing by x — 1 again, we get 
fix) = & — 1)? — 2). 

Now x’? — 2 has no zeros in Q (why not?), so it is irreducible over Q. 
Therefore, (x — 1)*(x’ — 2) is the factorization of f(x) into a product of 
irreducible elements of Q[x]. 

Notice that f(x) © R[x] since Q C R. In R[x], x’ — 2 factors into (x — 
V2)(x + WO): Therefore, f(x) = (x — 1)*(x — V2)(x + a is the factorization 
of f(x) into a product of irreducible elements of R[x]. This is also a factoriza- 
tion of f(x) into a product of irreducible elements of C[x]. 


Let f(x) = — 8x? + 8x — 6 € Q[x]. Since 3 is a zero of f(x), we can write 
ie Pie - ea — x + 1). Using the quadratic formula, we see that 
— x + 1 is irreducible over Q and over R. Therefore this is the correct 
cohen of f(x) into a product of mredable omens of Q[x] and R[x]. 
In C, x? — x + 1 has two zeros: 3 + Be i and 3 — i. So in C[x], the 
factorization pot x) into a product of irreducibles is fix) = = 2(x — 3)(x - 3 
- X38 i)(x —3t X3 i), 


In the previous examples, finding a zero of the polynomial was the key step 
in computing the factorization. However, a polynomial with no zeros may 
still have a factorization. For example, let f(x) = x* + 2. Clearly f(x) has no 
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zeros in R. So if f(x) does factor over R, it must factor into a product of 
two polynomials of degree 2. We write 


xi +2 = (x? + ax + b)(x*? + cx + a), 

where a, b, c, d are in R. Multiplying out these factors, we get 

xi +2=x'+ (a+ c)x3 + (ac + b + dx’? + (ad + be)x + bd. 
This leads to the equations 


a+c=0, 
ac+b+d=0, 
ad + bc = 0, 
bd =2. 


Thus, c = —a and d = 2/b. Substituting into ad + bc = 0 gives 2a/b = ab 
or 2a = ab’. You can easily check that a = Ois not a possibility. So dividing 
by a gives b? = 20r b = +V2. Hence d = 2/b = +V2. Substituting these 
values into ac + b + d = 0, we get & = +2V2. Since a@ = —2V2 is im- 
possible, we get the following solutions: 


a= V8; b= v2; c=— W8; d= v2. 
Therefore, 
xt+2= (x2 + W8x + V2)(x? — W8x + V2). 


So we have established that x* + 2 is irreducible over Q but not over R. 
The two quadratic factors are irreducible over R: if they were not, then 
x’ + 2 would have linear factors and therefore would have a real zero, 
which it does not. 

Over C, these two quadratic factors are not irreducible. To see this, we 
show that x* + 2 has 4 zeros in C. A complex number z is a zero of x* + 
2 if and only if z is a fourth root of —2. Using Theorem 7.3.9, we get 
the solutions: 


2=V2(55 +5), 
onva(-cye cy) 
s(t) 


So, in C[x], we have 


xi +2 = (x — z)(x — 2)(% — 23)(X — Za). 


Corollary 8.2.15 
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The Unique Factorization Theorem for Polynomials can be re-stated 
another way. Let f(x) = p,(x) - po(x) - ... + Dm(x), where each p;(x) is 
irreducible in F[x]. Let a; be the leading coefficient of p(x). Then p(x) = 
agq(x), where qx) is a monic irreducible polynomial in F[x] associate to 
p(x). Let a = a+ +... + a,. Then f(x) = a+ qi(x) « qo(x) +... + Am(X). 

If we group together those q,(x)s that are associate to each other and 
use the fact thatif two polynomials are associate then one is a constant times 
the others, then we can write f(x) in the form 


f(x) = c+ Py(x)™ + Py(xy™ +... + Px)", 


where c € F and each P(x) is a monic irreducible polynomial in F[x]. 
Note that each P;(x) equals some q;(x) and the exponent m; is the number 
of times that qg;(x) or one of its associates appears in the product 
a(x) + g(x) + ... + dn(x). Therefore, no P(x) is associate to P(x) if 
i # j. So we have the following. 


Let f(x) € F[x], deg f(x) = 1. Then there exist a unique element c in F, 


unique monic irreducible polynomials P,(x), P(x), ..., P(x) in F[x], and 
unique integers m,, m),..., m, Such that 

f(x) =c+ Pi(xy™ + Po(xym +... Px) 
Moreover, P(x) is not associate to P;(x) if i ¥ j, for all i, 7 = 1,2,...,7. 


We conclude this section with a generalization of Corollary 8.1.6. 


Definition 8.2.16 Let f(x) € F[x] and let c be a zero of f(x). We say c 
is a zero of f(x) of multiplicity m if (x — c)” is a factor of f(x) but (x — 
cy"*) is not. A zero of multiplicity 1 is called a simple zero; a zero of 
multiplicity greater than 1 is called a multiple zero. 


Let f(x) = (x + 3)(x — 7)3(x — 9). Then —3 is a zero of f(x) of multiplicity 
4, 7 is a zero of multiplicity 3, and 9 is a zero of multiplicity 1. 


Let f(x) € F[x] and let c, c), ... , c, be the distinct zeros of f(x) with 
multiplicities m,, m),...,m, respectively. Then 


F(x) = (& — 1) — 2)... — CB (X), 
where g(x) € F[x] and g(x) has no zeros in F. 


PROOF: By Corollary 8.2.15, we can write 
f(x) = c+ Py(xy" + Po(x) +... PAX)", 
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where each P(x) is a monic irreducible polynomial in F[x] and no two of 
them are associate. 

By the definition of multiplicity, (x —c;)" is a factor of f(x) and 
(x — ¢)'"*1 is not, for each i = 1, 2,..., t. Since x — c; is monic irreducible, 
by uniqueness of factorization (see Corollary 8.2.15), each x — c; equals some 
P,(x) of degree 1 and m; = nj. 

Conversely, if some P,(x) has degree 1, then P,(x) = x + a, for some a, 
in F. It follows that —a, is a zero of f(x) and so P,(x) = x — c; for some i. 
Again by unique factorization, we have n, = mj. 

Consequently, exactly t of the polynomials P;(x) have degree 1. Renumber 
the P(x)s so that the first t of them have degree 1 and so that P(x) = x — 


c and nj = m; for j = 1,2,...,¢. 

Let g(x) = c+ Pisi(x) + Pax) +... + P(x) ifr > tor g(x) =cifr=t 
Then f(x) = (x — c)™(x — c)™ ... (x — ¢,)"g(x). Since deg Pj(x) = 2 for 
j=tt+41,...,7, g(x) can have no linear factors and therefore g(x) has no 


zeros in F. «* 


Let f(x) © F[x] and let c,, ©, ... , c, be the distinct zeros of f(x) with 
multiplicities m,, m),..., m, respectively. Then m, + m, +... +m, = 
deg f(x). 


PROOF: We leave it as an exercise. ©? 


HISTORICAL COMMENTS: THE FUNDAMENTAL THEOREM 
OF ALGEBRA 


In the historical comments at the end of Section 8.1, we saw that much effort 
was spent trying to find extensions of the quadratic formula to higher degree 
polynomials, with the result that appropriate formulas for third- and fourth- 
degree polynomials were discovered in the 16th century and the impossibility 
of a formula for polynomials of degree higher than four was proved in the 
19th century. During this same stretch of time, the 16th to the 19th century, 
a related problem was being discussed: whether or not every real polynomial 
of degree n has n zeros (counting multiplicity) and where those zeros actually 
exist. It was known of course that not every real polynomial has zeros in R 
and that some have zeros in C. 

Albert Girard (1595-1632) was the first mathematician to assert that 
every real polynomial of degree n has n zeros. He did not claim that all of 
the zeros had to be in C but allowed for the possibility that there may be a 
bigger field than C in which all of the zeros exist. Euler took Girard’s assertion 
a step further by claiming that every real polynomial has all of its zeros in 
C. He was able to prove his assertion for all polynomials of degree <6. 

Attempts to prove the theorem for all polynomials were made by Euler, 
Jean le Rond D’Alembert (1717-1783), Joseph Louis Lagrange (1736-1813), 
and Pierre Simon Laplace (1749-1827). All of these attempts fell short, 
mainly because, in trying to prove that a real polynomial has n zeros, the 
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mathematicians assumed that such a polynomial has at least one zero in C. 
This flaw was pointed out by Carl Friedrich Gauss (1777-1855), one of the 
greatest mathematicians of all time. Gauss gave the first proof of what has 
come to be known as the Fundamental Theorem of Algebra: that every 
polynomial with real coefficients (or even complex coefficients) has at least 
one zero in C. From the Fundamental Theorem, it is then fairly straightfor- 
ward to show that a real polynomial of degree n has n zeros in C. We will 
discuss these matters in the next section. 

Gauss’s first proofof the Fundamental Theorem of Algebra was contained 
in his doctoral thesis in 1799. Even this proof contained some statements that 
Gauss did not prove rigorously and only years later, with the development of 
topology, were the gaps in the proof filled. However, Gauss subsequently 
gave three additional proofs of the Fundamental Theorem of Algebra and 
these proofs were completely rigorous. 


Exercises 8.2 


1. Prove that 4x* — 5x + 2 is irreducible over Q and R but not over C. 


2. Factor the following polynomials into irreducibles over Q, R, and C. 
(a) x7 -— 5 
(b) 2x3 — 11x? + 16x — 7 
(c) x° — 3x* + 2x3 + 7x? — 9x — 10 

3. Let f(x) be an irreducible polynomial in F[x]. Prove that if deg f(x) = 
2, then f(x) has no zeros in F. 


4. Prove Lemma 8.2.2. 
5. Prove that every polynomial in F[x] is associate to a monic polynomial. 


6. Let f(x) be an irreducible polynomial in F[x]. Prove that any associate 
of f(x) is irreducible in F[x]. 


7. Let f(x), g(x) © Fx]. Suppose that f(x) divides g(x). 
(a) Prove that any associate of f(x) divides g(x). 
(b) Prove that f(x) divides any associate of g(x). 
8 Let f(x), g(x), A(x) € F[x]. Suppose that f(x) divides g(x) and f(x) 
divides h(x). Prove that f(x) divides s(x)g(x) + t(x)h(x) for all s(x), t(x) 
in F[x]. 
9. Let f(x), g(x) © F[x]. 
(a) Suppose that f(x) divides g(x) and g(x) divides f(x). Prove that f(x) 
and g(x) are associate. 
(b) State the converse of part (a). Prove or disprove. 
(c) State the contrapositive of part (a). 


10. Write f(x) ~ g(x) if f(x) and g(x) are associate. Prove that this defines 
an equivalence relation on F[x]. 
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11. 


12. 
13. 


14. 


15. 
16. 


17. 
18. 
19. 


20. 
21. 
22. 
23. 


24. 


25. 


26. 


(a) Prove Lemma 8.2.6. 
(b) State the converse of Lemma 8.2.6. Prove or disprove. 


Prove the uniqueness part of Theorem 8.2.8. 

Prove that if f(x) = g(x)h(x) + r(x), then the g.c.d. of f(x) and g(x) is 

equal to the g.c.d. of g(x) and r(x). 

Find the g.c.d. of f(x) and g(x): 

(a) f(x) = 3x* — 3x? and g(x) = x? — 2x — 8 

(b) f(x) = x® — 5x° + 10x4 — 14x3 + 16x? — 8x and g(x) = x4 + x - 
4x? + 2x — 12 

Prove Corollary 8.2.11. 

Let f(x), g(x) © Flx). 

(a) Prove that if f(x) and g(x) have a common zero in F, then they are 
not relatively prime. 

(b) State the converse of part (a). Prove or disprove. 


Prove Corollary 8.2.12. 
Prove Corollary 8.2.13. 


Suppose that p(x) and q(x) are irreducible in F[x] and p(x) divides q(x). 
Prove that p(x) and q(x) are associate. 


Find all irreducible polynomials of degree 2 and degree 3 in Z,[x]. 
Find all irreducible polynomials of degree 2 and degree 3 in Z,[x]. 
Prove Corollary 8.2.18. 

Let = be the relation on F[x] — {o} defined by f(x) = g(x) if deg 


f(x) S deg g(x). Determine whether this relation is reflexive, symmetric, 
transitive, or antisymmetric. Give proofs for each of your assertions. 


Let f(x) € F[x] and let Df(x) denote its derivative as defined in Exercise 

18 of Section 8.1. 

(a) Let c € F be a zero of f(x). Prove that c is a simple zero of f(x) if 
and only if c is not a zero of Df(x). 

(b) Suppose that f(x) and Df(x) are relatively prime. Prove that all of 
the zeros of f(x) are simple. 

(c) State the converse of part (b). Prove that this converse is false in 
general but is true if F is the field C of complex numbers. 


Let a€ C,a #0 and let f(x) = x" — a. Prove that all the zeros of f(x) 
are simple. 


Let p be a prime number and let a € Z,. Let f(x) = x? — a € Z,[x]. 
Prove that all of the zeros of f(x) are multiple zeros. 


Discussion and Discovery Exercises 


D1. Compare and contrast the proofs of Theorem 5.3.5 and Theorem 8.2.8. 


Describe the similarities in the proofs as well as their differences. 


8.3. 


Theorem 8.3.1 


Theorem 8.3.2 
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D2. Repeat Exercise D1 for the proofs of Theorem 5.4.7 and Theorem 8.2.14. 


D3. Explain why, in the algorithm for computing the g.c.d. of two polynomi- 
als, the process will eventually terminate. 


POLYNOMIALS OverR C, R, AND Q 


The Fundamental Theorem of Algebra 


In this section we apply some of the results of this chapter to polynomials 
over R and C. First, using the Fundamental Theorem of Algebra, we see 
how polynomials over the complex numbers factor into irreducibles and then 
we do the same thing for polynomials over the real numbers. In particular, 
we will see that the only irreducible polynomials in C[x] are the polynomials 
of degree 1 and the only irreducible polynomials in R[x] are the polynomials 
of degree 1 and the polynomials of degree 2 with no real zeros. Then we 
consider polynomials in Q[x]. Unlike R[x] and C[x], there are irreducible 
polynomials in Q[x] of every degree. For example, one can show, although 
it is not obvious, that if p is a prime number, then the polynomial x” + p is 
irreducible over Q for every positive integer n. This is a consequence of the 
Eisenstein criterion, which we prove in this section. 

The key result needed for this section is the Fundamental Theorem of 
Algebra. As we saw in the historical comments at the end of Section 8.2, 
this is a very deep result and was first proven by Gauss. Since then, many 
different proofs have been devised, but they all require advanced techniques. 
You may see a proof in a later course (abstract algebra, complex analysis, 
or one of several other possibilities); here we will simply state this impor- 
tant theorem. 


Fundamental Theorem of Algebra Let f(x) € C[x], deg f(x) = 1. Then 
f(x) has at least one zero in C. 


Note that this theorem applies to polynomials with real coefficients since 
R CC. For example, the polynomial f(x) = 3x* + 4x° + 2 has no real zeros. 
(This can be seen easily by graphing.) But the Fundamental Theorem of 
Algebra tells us that f(x) has at least one zero in C. In fact, it will have four 
zeros in C if we count multiplicity. More generally, we have the following: 


Let f(x) € C[x], deg f(x) = n = 1. Then f(x) has exactly n zeros in C, 


counting multiplicity. Specifically, if c;, @, ... , c, are the distinct zeros 
of f(x) in C, with multiplicities m,, m),..., m, respectively, then f(x) = 
a(x — c)™(x — co)" ... (x — c)™ where a € C that is, f(x) factors into 


all linear factors in C[x] andm, +m+...+m,=n. 
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PROOF: By the Fundamental Theorem of Algebra, f(x) has at least one zero 
in C. By Theorem 8.2.17, we can write f(x) = (x — c)™(x — @)™...(4% — 
c,™ where g(x) has no zeros in C. 

Since, again by the Fundamental Theorem of Algebra, every polynomial 
of degree =1 in C[x] has at least one zero in C, we must have deg g(x) = 
0, so g(x) = a for some a in C. The theorem is now proved. « 


No polynomial in C[x] of degree at least 2 is irreducible. 

PROOF: We leave it as an exercise. “” 

Consider the polynomial f(x) = x” — 1 where n is a fixed positive integer. 
In Section 7.3 we learned that the zeros of f(x) are the nth roots of unity 


@p, @,, @2, ... , @,-1;, Where @, = cos 27k/n + isin 27k/n forO Sk Ss 
n — 1. So we can write x” — 1 = (x — ay)(x — @)... (% — @,-1). 


Real Polynomials 


We now consider polynomials over the field of real numbers R. We will 
show that the irreducible polynomials in R[x] are the polynomials of degree 
1 and the polynomials of degree 2 that have no real zeros. 


Definition 8.3.4 Let f(x) = ax’ + bx. + c¢ © R[x]. Then the real number 
b? — 4ac is called the discriminant of f(x). 


A polynomial of degree 2 in R[x] is irreducible over R if and only if its 
discriminant is less than 0. 


PROOF: This proof, which follows easily from Lemma 8.2.2 and the quadratic 
formula, is left as an exercise. @» 


Before we prove our main result about polynomials in R[x], recall that 
we proved in Theorem 7.3.5 that if z © C is a zero of a polynomial f(x) in 
R[x], then its complex conjugate Z is also a zero. It is also true that their 
multiplicities as zeros of f(x) are equal. 


Let f(x) € R[x], deg f(x) = 1. Let z be a complex nonreal zero of f(x). 
Then z and Z have the same multiplicities as zeros of f(x). 


PROOF: Let m and n be the multiplicities of z and z, respectively. Assume 
that m > n. Then f(x) = (x — z)"(x — Z)"g(x), where g(x) © R[x] and 
g(z) # Oand g(z) ~ 0. 

Let h(x) = (x — z)(x — Z). Then h(x) = x? — (z + Z)x + zz, which is a 
polynomial in R[x], since both z + Z and zz are real numbers. 

Then f(x) = A(x)"(x — z)""g(x). 


Theorem 8.3.7 


Corollary 8.3.8 


Lemma 8.3.9 


Lemma 8.3.10 
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The polynomial (x — z)""g(x) must have real coefficients since it is equal 
to f(x)/h(x)", which is a quotient of two polynomials with real coefficients. 
But (x — z)”""g(x) has z as a zero and not Z, contradicting Theorem 7.3.5. 
Therefore, the assumption that m > n must be false. A similar proof shows 
that n > m is false also. Thus m = n. &@ 


Let f(x) € R[x], deg f(x) = 1. Then f(x) can be factored in R[x] into a product 
of linear polynomials and quadratic polynomials of negative discriminant. 


PROOF: Let 7,, %, ... , 7; be the distinct real zeros of f(x), if any, with 
multiplicities m,, m),..., m, respectively. The complex, nonreal zeros of 
f(x) occur in conjugate pairs. Let 2), Z1, 22, 2,.-., %, Z, be the distinct 
complex, nonreal zeros of f(x), if any, with multiplicities m,, m.,..., m4, 
respectively. (Note: Here we use Lemma 8.3.6.) 

By Theorem 8.3.2, we can write: 


f(x) = ale — n(x — )™.. Ce — (x — 21)"(% — 2)" 
oe (XH = ZX — Zs)" 


where a € R is the leading coefficient of f(x). 

Let g(x) = (x — z)(x — Z) fori = 1,2,...,k. 

Then g(x) = x? — (z; + Z)x + 22), which is a polynomial in R[x], since 
both z; + z; and z;z; are real numbers. Also, g(x) has no real zeros so it must 
be irreducible and have negative discriminant by Lemma 8.3.5. « 


No polynomial in R[x] of degree at least 3 is irreducible. 


PROOF: We leave it as an exercise. “* 


Factoring polynomials of degree 3 or better over the reals is in general 
a difficult problem. Even finding zeros of such polynomials is difficult. The 
following result from first-year calculus, a special case of Rolle’s Theorem, 
is helpful. 


Let f(x) € R[x]. Between any two distinct real zeros of f(x) there is a point 
where its derivative Df(x) is zero. Therefore if f(x) has m distinct real zeros, 
then Df(x) has at least m — 1 real zeros. 


Another result of calculus, a consequence ofthe Intermediate Value Theo- 
rem, says the following: 


Let f(x) € R[x]. If f(r) > 0 and f(m) < 0 for some r, and 7, in R, then f(x) 
has at least one zero between 7, and rp. 


Let f(x) = x + 4x — 9. Then Df(x) = 3x? + 4, which is never 0. It follows 
from Lemma 8.3.9 that f(x) has at most one real zero. Since f(1) = —4 and 
f(2) = 7, then by Lemma 8.3.10, f(x) has a zero between 1 and 2. 
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Polynomials over Q 


In general, determining whether or not a polynomial is irreducible over the 
field of rational numbers Q is a difficult problem. One way to factor a 
polynomial is to find zeros, although, as we have seen, a polynomial with 
no zeros in a particular field may still be factorable over that field. In what 
immediately follows we will concern ourselves with ways of finding rational 
zeros of polynomials in Q[x]. Then we will consider some tests for irreduc- 
ibility. 

Let g(x) € Q[x]. If we multiply g(x) by the least common denominator 
of its coefficients, we get a polynomial with integer coefficients. More specifi- 
cally, there is an integer s such that f(x) = sg(x) is a polynomial with 
integer coefficients. Clearly, any zero of g(x) is a zero of f(x) and vice versa. 
Therefore, in developing criteria for finding zeros of polynomials in Q[x], it 
suffices to consider polynomials with integer coefficients. We will let Z[x] 
denote the set of such polynomials. Note that Z[x] is closed under addition 
and multiplication of polynomials. 


Let f(x) = a) + qx + ax? +... + a,x" © Z[x]. Let ; be a zero of f(x) in 


Q, where (r, s) = 1. Then r divides aj and s divides a,. 
PROOF: Substituting ; for x and multiplying through by s” gives the equations 


= -1 
Ov" + Apart is+...+ ars” + as" =0, 
as" = —r(ayr") + a,r'?s +... Fas"). 


This last equation implies that r divides ays" and since (r, s) = 1, we must 
have that r divides ay. (Why?) 
We can also write 


a,r" = —s(a,yr") +... + ars”? + ays"). 

Thus s divides a,r” and since (r, s) = 1, s divides a,. «* 
Let f(x) be a monic polynomial in Z[x]. Then any rational zero of f(x) must 
be an integer that divides the constant term of f(x). 


PROOF: We leave it as an exercise. @ > 


Let f(x) = 30x? — 17x’ — 3x + 2. By Theorem 8.3.11, if : is a rational zero 


of f(x), then r | 2 and s | 30. The possible values of r and s are: r = +1 or 


EXAMPLE 4 


EXAMPLE 5 


Theorem 8.3.13 


Theorem 8.3.14 
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+2 ands = +1, +2, +3, +5, +6, +10, +15, or +30. Plugging in values, we 
get that the zeros of f(x) are 3, — 3, and 2. We can write 


terraofe- ler 3)e3) 


= (2x — 1)(3x + 1)(5x — 2). 


Let f(x) = 12x3 — 3x + 2. Any rational zero : of f(x) must satisfy r = +1, 


+2 and s = +1, +2, +3, +4, +6, +12. After trying all possible values, we 
see that f(x) has no rational zeros. 

We can then ask how many real zeros f(x) has. Simply graphing f(x) 
using methods of first-year calculus reveals that f(x) has only one real zero. 
Therefore, f(x) has one irrational real zero and two nonreal complex zeros. 
We can also conclude that f(x) is irreducible over Q but not over R. 


We will use Corollary 8.3.12 to prove that V2 is irrational. (Recall that we 
proved this another way in Section 5.4.) Since V2 is a zero of the polynomial 
x? — 2 in Z[x], Corollary 8.3.12 tells us that if V2 is rational it must be an 
integer that divides 2. But clearly +1 and +2 are not zeros of x? — 2. 


We now give some criteria for a polynomial f(x) in Z[x] to be irreducible 
over Q. We first state without proof a result due to Gauss. 


Let f(x) € Z[x]. If f(x) is irreducible over Z, then f(x) is irreducible over Q. 


Theorem 8.3.13 says that if a polynomial with integer coefficients cannot 
be factored into two polynomials of smaller degree with integer coefficients, 
then it cannot be factored into two polynomials of smaller degree with 
rational coefficients. 


Eisenstein Irreducibility Criterion Let f(x) = a + ax + @x?o+... + 
a,x" © Z[x]. Suppose there exists a prime p such that p divides a; for all j = 
0,1,2,...,2— 1, p does not divide a,, and p* does not divide ay. Then f(x) 
is irreducible over Q. 


PROOF: The proof is by contradiction. Suppose f(x) factors into two polyno- 
mials g(x) and h(x) in Z[x] of smaller degree than f(x). 

Let g(x) = bo + Dix + by? +... + Dyax™ and h(x) = co + cx + 
ox? +... + ¢,x". Then ay = bocy, and since p divides a), p must divide by 
Or Cy. Also, since p* does not divide ay, p cannot divide both by and cy. For 
the sake of argument, we assume p divides cy but not bo. 

Now a, = b,,c,, So p does not divide b,, and p does not divide c,. Let k 
be the smallest integer such that p does not divide c,. ThenO<k=sr<n. 
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Consider the coefficient a, = boc, + bicy-1 +... + bycy. Since p divides 
the integers a,, Co, C},.-- 5 Cy-1, it follows that p divides boc,. But this is a 
contradiction since p does not divide by and p does not divide c,. 

Therefore f(x) must be irreducible over Z, and by the previous theorem 
it must also be irreducible over Q. =~ 


Let f(x) = 7x° — 9x4 + 6x? + 15. Applying the Eisenstein Criterion with 
p = 3, we get that f(x) is irreducible over Q. 


Let f(x) = x" + p, where n is some positive integer and p is a prime. By the 
Eisenstein Criterion, f(x) isirreducible over Q. Thus we see that Q[x] contains 
irreducible polynomials of arbitrarily high degree. 


The method of determining irreducibility that we consider next is called 
reducing mod p. 


Definition 8.3.15 Let f(x) = ay + ax + ax? +... + a,x" © Z[x], deg 
f(x) = 1. Let p be a prime number. Then the polynomial 


f(x) = [ao] + [ai]x + [a]x? +... + [a,]x" © Z,[x] 


is called the reduction of f(x) mod p. As usual, the notation [a] means 
the congruence class of a mod p. 


Let f(x), g(x), A(x) © Z[x]. 


1. deg f,(x) = deg f(x) and deg f,(x) = deg f(x) if and only if p does 
not divide a,. 
2. If f(x) = g(x)A(x), then f,(x) = g,(x)h,(x). 


PROOF: We leave it as an exercise. ** 


Let f(x) © Z[x], deg f(x) = 1. Let p be a prime and let f,(x) be the reduction 
of f(x) mod p. If f,(x) is irreducible over Z, and deg f,(x) = deg f(x), then 
f(x) is irreducible over Q. 


PROOF: It suffices to prove that f(x) is irreducible over Z. Suppose, on the 
contrary, that f(x) = g(x)h(x) where g(x) and h(x) are polynomials in Z[x] 
both of degree less than the degree of f(x). 

Then f,(x) = g,(x)h,(x) and 


deg g,(x) = deg g(x) < deg f(x) = deg f,(x), 
deg h,(x) = deg h(x) < deg f(x) = deg f,(x). 


This implies that f,(x) is not irreducible over Z,, a contradiction. ©» 


EXAMPLE 8 


EXAMPLE 9 
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Let f(x) = x* + 6x° — 5x + 3. We try reduction mod 2 and get f(x) = 
x4 + x + [1]. If f(x) has a linear factor, then it must have a zero in Z). But 
f(0}) = [1] and f([1]) = [1]. 

Thus if f,(x) factors, it must have two irreducible quadratic factors. But 
the only irreducible polynomial of degree 2 in Z,[x] is x? + x + [1]. (Check. 
See Exercise 20 of Section 8.2.) A calculation shows that f(x) # (x? + x + 
[1])(@? + x + [1]). 

Therefore f,(x) is irreducible over Z, and by the previous theorem f(x) 
is irreducible over Q. 


Let f(x) = x° + 17x + 36. One way to check irreducibility over Q is by 
determining whether f(x) has any zeros in Q. As we have seen, it suffices 
to look at the divisors of 36. Since there are many of these, we will try 
reduction mod p instead. 

f(x) = x3 + x is not irreducible over Z). 

fix) = x8 + [2]x is not irreducible over Z;. 

fsx) = x2 + [2]x + [1] has no zeros in Z; and therefore is irreducible 
over Zs. 

Thus f(x) is irreducible over Q. 


Note: To apply the reduction method, we just have to show that /,(x) is 
irreducible for some p that does not divide the leading coefficient of f(x). 
However, it is possible to have a polynomial f(x) in Z[x] that is irreducible 
over Q but for which /,(x) is not irreducible over Z, for any prime p. An 
example is x* + 1. The proof that this is not irreducible over Z, for any 
prime p is nontrivial. 


“2 HISTORICAL COMMENTS: GALOIS’S THEORY 


In Section 8.1 we noted that Abel proved that the general polynomial equa- 
tion of degree greater than 4 is not solvable by radicals. But the story does 
not end there. Some polynomial equations of degree greater than 4 can be 
solved by radicals. For example, Gauss was able to prove that the equation 
x" — 1 =0, the so-called binomial equation, is solvable by radicals. (We saw 
in Section 7.3 that the roots of this equation are the nth roots of unity.) 

What Gauss did not do was give a criterion for determining whether or 
not any given polynomial equation is solvable by radicals. This more difficult 
problem was solved by Evariste Galois (1811-1832). 

Galois was a young genius whose work on the theory of polynomial 
equations was either ignored or misunderstood by the leading mathemati- 
cians of his time. Two of his papers were submitted to Cauchy, who lost 
them, and another to Fourier, who died soon after, with the result that this 
paper was also lost. Another of Galois’s papers was sent to Poisson, who 
returned it because he thought it unintelligible. Unfortunately much of 
Galois’s time was spent in prison because of radical political activities, and 
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in May 1832 he was challenged to a duel. The night before the duel was to 
take place, Galois wrote up as best he could the results of his work. The 
next day he was killed in the duel. Years later, his work was recognized for 
its significance and was published in leading mathematical journals. An entire 
branch of abstract algebra developed from Galois’s work and today is known 
as Galois Theory. Its consequences are still being felt in modern mathemati- 
cal research. 


Exercises 8.3 


1. Prove Corollary 8.3.3. 


. Prove Lemma 8.3.5. 
. Prove that every polynomial in R[x] of odd degree has at least one 


real zero. 
(Hint: The Intermediate Value Theorem will help.) 


. Determine the number of real zeros of the following polynomials. Find 


the integer closest to each zero. 
(a) f(x) = 2x3 + 3x? — 12x + 4 
(b) f(x) = x — 9x? + 27x — 2 

(c) f(x) = x5 — 20x + 28 


. Let f(x) = xt — 2x3 + 4x? — 3x + 2. 


(a) Factor f(x) into irreducibles over R. 

(Hint: The irreducible factors have integer coefficients.) 
(b) Prove that f(x) has no real zeros. 
(c) Find all of the complex zeros of f(x). 


1 eas ‘ : QT es, QT 
. Let w be the primitive eighth root of unity cos — + i sin —. 


8 8 
(a) Find a fourth-degree polynomial with integer coefficients of which 
w is a Zero. 
(b) Find a second-degree polynomial with real coefficients of which w 
is a Zero. 
(Hint: Factor the polynomial x* — 1.) 


7. Prove Corollary 8.3.8. 
8. Let f(x) € R[x]. Lemma 8.3.9 says that if f(x) has m distinct real zeros, 


10. 
11. 


then its derivative Df(x) has at least m — 1 real zeros. Is the converse 
of this statement true? Prove your answer. 


. Let f(x) € R[x]. Prove that if Df(x) has n real zeros, then f(x) has at 


most 1 + 1 real zeros. 

Prove Lemma 8.3.10. 

Let g(x) € Q[x]. 

(a) Prove that there exists a nonzero integer s such that the polynomial 
f(x) = sg(x) is a polynomial with integer coefficients. 


8.3 


12. 


13. 


14. 


15. 
16. 
17. 
18. 


19. 
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(b) Prove that c is a zero of g(x) if and only if c is a zero of f(x). 


Find the rational zeros, if any, of the following polynomials and then 
find their irreducible factors over Q, R, and C respectively. 

(a) x — x? — 3x +2 

(b) x* — 6x3 + 7x? — 12x + 10 

(c) 12x? = 8x? — 3x4 2 


Use the method of Example 5 to prove the following. 

(a) Prove that V3 is irrational. 

(b) Prove that V2 is irrational. 

(c) Prove that W/2 is irrational for every n € Z, n = 2. 

(d) Prove that if p is a prime number, then Wp is irrational for every 
nEZ,n=2. 


Let f(x) = x3 — 3x + 5. Show that f(x) has one irrational real zero and 
two complex nonreal zeros. 


Prove Corollary 8.3.12. 
State the converse of Theorem 8.3.13. Prove or disprove. 
Prove Lemma 8.3.16. 


Prove that the following polynomials are irreducible over Q. 

(a) f(x) = 4x5 — 7x3 + 21x? + 28 

(b) f(x) = x4 + 7x3 — 9x? + 3x - 1 

(c) f(x) = x3 + 4x? — 9x + 96 

Let f(x) = x* +1. 

(a) Prove that f(x) is irreducible over Q. 

(b) Prove that f,(x) is not irreducible over Z, for p = 2, 3, 5, and 7. 
(c) Factor f(x) in R[x]. 

(d) Find all the zeros of f(x) in C. 

(e) Use part (c) and the method of partial fractions to compute the 


indefinite integral i; aan. 


Discussion and Discovery Exercises 


D1. 


D2. 


Look up the proof of Lemma 8.3.9 in a calculus text and write out 
the proof. 


One consequence of Galois’s theory of equations is the following theo- 
rem: Let f(x) © Q[x] be irreducible over Q and have degree n where 
n is a prime number greater than 3. Suppose that f(x) has exactly two 
nonreal zeros in C. Then the equation f(x) = 0 is not solvable by 
radicals. 

Use this theorem to show that the polynomial equation x° — 20x + 
5 = 0 is not solvable by radicals. 
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D3. 


D4. 


DS. 


In this problem we will see how to express the roots of the equation 

x° — 1 = 0 as radicals. The roots of this equation are the fifth roots of 

unity: 1, 2, £7, &, £4, where £ = cos = + isin = 

(a) Prove that £* is the complex conjugate of ¢. 

(b) Let a = £ + £*. Prove that a is a zero of the polynomial e+x-. 

-1+ 

(c) Prove that cos 2 = af 

(d) Prove that x° — 1 = 0 is solvable by radicals. 

(e) Use your answer to (c) to construct a regular pentagon inside a 
circle of radius 1 using only a compass and straightedge. 


A complex number a is called an algebraic number if a is a zero of 

a polynomial f(x) in Q[x], where f(x) is not the zero polynomial. 

For instance, 2 and i are complex numbers since they are zeros of 

the polynomials x? — 2 and x? + 1, respectively. On the other hand, 7 

is not an algebraic number, although this fact is not easy to prove. 

Let a be an algebraic number. Let S, = {f(x) © Q[x] | f(a) = 0}; 
that is, S, is the set of all polynomials with rational coefficients that 
have a as a zero. (Note that the zero polynomial is an element of S,.) 

Let P(x) be the monic polynomial of smallest degree in S,. Then 
P,(x) is called the minimal polynomial of a. 

The polynomial x? — 2 is the minimal polynomial of V2 since no 
polynomial of smaller degree with rational coefficients has V2 as a 
zero. V2 is a zero of the polynomial x — V2 but this polynomial does 
not have rational coefficients. Similarly, the minimal polynomial of i is 
x +1. 

(a) Explain why the minimal polynomial of the algebraic number a has 
to exist. 

(b) Prove that the elements of S, are precisely the polynomials that 
have P,(x) as a factor; that is, if f(x) € S,, then P,(x) divides f(x). 
(Hint: Use the Division Algorithm.) 

(c) Prove that P,(x) is irreducible. 

(d) Prove that P,(x) is the only monic polynomial of smallest degree 
in S,. 

(e) Prove that if g(x) is a monic irreducible polynomial in Q[x] and 
g(a) = 0, then g(x) = P,(x). 


Using the notation of the previous problem, find P,(x) if 


(a) a = V3 


(b) a = W5. 
(c) a= V2. 


(d) a= V24+V3. 


ANSWERS AND HINTS 
TO. SELECTED 
EXERCISES 


Exercises I|.1 


1. (a) statement 
(d) neither 
(g) statement 
(j) statement 


2. (a) This statement has a universal quantifier: for all rectangles T, the area 
of T is its length times its width. 

(d) This statement has a universal quantifier: for all integers n and m, if 
n is even and m is odd, then n + m is even. 

(g) This statement has a universal quantifier: for all real-valued functions 
f, if f is differentiable on [a, b], then f is continuous on [a, 5]. 

(i) This statement has a universal quantifier and an existential quantifier: 
for every positive real number x, there is a real number y such that 
y=Vx. 

3. (a) There is a rectangle T such that the area of T is not its length times 
its width. 

(d) There is an even integer n and an odd integer m such that n + mis odd. 

(g) There is a real-valued function f that is differentiable on [a, b], but 
is not continuous on [a, Db]. 

(i) There is a positive real number x that has no square root. Or: there 
is a positive real number x such that for all real numbers y, y 4 Vx. 


5. (a) For every real number x, x? + x + 1 #0. 
(d) There is a polynomial function f that is not continuous at some point. 


7. (a) For all integers m and m, if n and m are even, then + mi is divisible 
by 4. 
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9. (a) A real-valued function f is not increasing on [a, b] if there exist x,, 


xX, © [a, b] such that x, < x, and f(x:) = f(x). 


Exercises |.2 


2. (a) P Q (=P) v @ 
T T T 
T F F 
F T T 
F F T 
(c) P Q —IGP) - Q) 
T T T 
T F T 
F T F 
F F T 
4. (a) Either August is sometimes not a hot month or September is 
never cool. 
(c) All cars are either uncomfortable or expensive. 
5. (a) There exist real numbers x and y such that xy = 0 but x ¥ 0 and 
y #0. 
(d) For every rational number r, either r = 1 or r = 2. 
7. (a) For all integers n and m, if nm is even, then n is even and m is even. 
10. (a) P My) PAQ OnP 
T T T T 
T F F F 
F T F F 
F F F F 
11. (a) S and T are equivalent statements. We will show that S and T are 


either both true or both false. Suppose that S$ is a true statement. 
Then if a is an assigned value of the variable x, P(a) ~ Q(a) is true. 
Hence P(a) is true and Q(a) is true. It follows that Vx, P(x) is a 
true statement and Vx, Q(x) is a true statement. Therefore, (Vx, 
P(x)) a (Wx, Q(x)) is a true statement. 

Now suppose that T is true. Then Vx, P(x) is true and Vx, O(x) 
is true. Let a be an assigned value of the variable x. Then P(a) is 
true and Q(a) is true. Thus P(a) A Q(a) is true. It follows that S is 
a true statement. 


ANSWERS AND HINTS TO SELECTED EXERCISES 297 


12. (a) 


15. (a) 


16. (a) 


P Q R (P 4 =Q) v —(R) 

T se T F 

T T F T 

T F T T 

T F F T 

F T T F 

F T F T 

F F T F 

F F F T 

By Exercise 13(c), P v [(Q A R) ~ S)] is equivalent to (P v (Q a 


R)) A (Pv S), which in turn is equivalent to ((P v Q) ~A(Pv R)) A 
(P v S). By Exercise 13(b), this last statement form is equivalent to 
(Pv Q)a (PV R)a(P v5). 

If P is true, then either P ~ Q or P ~ —Q is true depending on 
whether or not Q is true or false. Hence, if P is true, the statement 
form (P 4 Q) v (PA =Q) v (AP 4 Q) v (AP A AQ) is true. On the 
other hand, if P is false, then either —P ~ Q or —P a —Q is true, 
again depending on the truth or falsity of Q. Thus if P is false, 
(P a Q) v (P a -=Q) v (AP A Q) v (AP «A =Q) is true. It follows 
that (P ~ Q) v (P a-=Q) v (AP 4 Q) v (=P a =Q) is a tautology. 


Exercises 1.3 


1. (a) 


3. (a) 


5. (a) 
(c) 


P: For every polygon T, if T is a hexagon, then T has six sides. 
=P: There is a hexagon T such that T does not have six sides. 
P: For every real number x, if x is positive, then there is a real 
number y such that y = Vx. 
=P: There is a real number x such that x is positive and for all 
real numbers y, y # Vx. 


P Ma) (P => Q) PAO 
T T F F 
T F T T 
F T F F 
F F F F 


For every integer n, if n is odd, then n° is odd. 

The original statement is true. Let n be an odd integer. Then we 
can write n = 2t + 1 for some t € Z. Then n? = (2¢ + 1) = 8 + 
12f + 6t + 1 = 2(4f + 6f? + 3t) + 1, so that n? is odd. 
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7. 


11. 


12. 


17. 


(a) For all integers x, y, and z, if x, y, and z are consecutive integers, 
then x? + y? + z’ is even. 
(c) The original statement is false since 2? + 3° + 4? = 29, and 29 is odd. 


Suppose that [Vx(P(x) > Q(x))] 4 [Vx(Q(x) => R(x))] is a true statement. 
Then the statements Vx(P(x) = Q(x)) and Vx(Q(x) = R(x)) are both 
true statements. Let a be an assigned value of the variable x. Then the 
statements P(a) > Q(a) and Q(a) => R(a) are both true. By Exercise 
10, the statement P(a) > R(a) is true. It now follows that the statement 
Vx(P(x) => R(x)) is true. 


We compare truth tables. 

P Q R P=>(QvR) (P~’=Q)=>R 
T T T T T 

T T F T T 

T F T T T 

T F F F F 

F T T T T 

F T F T T 

F F T T T 

F F F T T 


(a) The statement ‘“‘P is a sufficient condition for Q”’ is the statement 
“Tf n is a multiple of 4, then n’ is a multiple of 4.” This is a true 
statement. If 7 is a multiple of 4, then we can write n = 4t for some 
integer t. Then n* = (41) = 4(4t7), proving that n’ is a multiple of 4. 

(d) The statement “‘Q is anecessary condition for P” is logically equiva- 
lent to the statement “‘P is a sufficient condition for Q”’ and therefore 
by part (a) it must be a true statement. 


Exercises 1.4 


1 


4. 


6. 


P Q =0 > -P P>@ 

T T T T 

T F F F 

F T T T 

F F iT T 

(a) Contrapositive: If the food is not spoiled, then the power is on. 


Converse: If the food is spoiled, then the power went off. 


(a) Because n and m are even, we can write n = 2t and m = 2s for some 
ts € Z. Thenn + m= 2(t + s). Therefore, n + m is even. 
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8. Jonathan cannot conclude anything. 


12. (a) 


(b 


a 


15. (a) 


22. (a) 


The statement Vn, m € Z, P(n, m) > Q(n, m) is true. Let n, m € 
Z and suppose n and m are odd. Then there exist integers s and ¢ 
such that n = 2s + 1 andm = 2t + 1. Thus nm = (2s + 1)(2t+ 1) = 
4st + 2t + 2s + 1 = 2(2st + t + s) + 1. Since 2st + t + s is an 
integer, it follows that nm is odd. 

The statement Wn, m € Z, Q(n, m) => P(n, m) is true. Prove it by 
proving its contrapositive. 


This statement is false. The negation is the statement ‘‘There exists 
an integer m such that n is even but Vn is not an even integer.” 
Since the integer 2 is even and V2 is not an even integer because 
it is not an integer, the negation is true. 


We first suppose that P and Q are logically equivalent. We must 
prove that the statement P © Q is always true. If P is true, then Q 
is true, since P and Q are logically equivalent. Hence from its truth 
table P © Q is true. On the other hand, if P is false, then Q is false 
and therefore P © Q is again true by its truth table. It now follows 
that P = Qs a tautology. 

Now suppose that P © Q is a tautology. Then P © Q is never 
false. Now P © Q is false precisely when either P is true and Q is 
false or P is false and Q is true. Thus P and Q are either both true 
or both false and hence are logically equivalent. 


Exercises 2.1 


1. (a) 
(d) 
(g) 


2. (a) 
(d) 
(g) 


4. (a) 
(d) 


(g) 


5. (a) 
7. (a) 


8. (a) 
(e) 


{15, 16, 17, 18, 19, 20, 21} 
{i, 4a, 9m} 
{2, 5, 13, 17, 29, 37, 41, 53, 61, 73} 


{11, 12, 13, 14, 15, ...} 
{3, 7, 11, 15, 19, 23, ...} 
{0, 7, —17, 27, —27, 37, —37,.. .} 


(EZ x <6} 
{x € Z|x = 4t for some t € Z} 


re 
2 


{x € Z| x = 5t for some t € Z} 


[=1s)° @)(=e,=)) We) @ 


{x ER| -4 <x < 4} 
{x € R| x > 16 or x < —2} 


nezhor{rer x=n¥ forsomene Z} 
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10. 


17. 


18. 


19. 


20. 


21. 


. (a) For all integers n, if n is a multiple of 9, then 1 is a multiple of 3. 


(d) Negation: There is an integer n such that n is a multiple of 9 andn 
is not a multiple of 3. 

(f) Converse: For all integers n, ifn is a multiple of 3, then n is a multiple 
of 9. The converse is false because 3 is a multiple of 3 and not a 
multiple of 9. 


(a) Letx € A. Then x = ns for some s € Z. Since n is a multiple of m, 
there exists t € Z such that n = mm. Then x = ns = tms = m(ts) © 
B. Thus A C B. 


(a) Let A be the set of all hexagons and B the set of all polygons with 
six sides. U is the set of all curves in the plane. 

(d) Let A = {x € Z| x = 6¢ for some integer ¢} and B = {x € Z| x is 
even}. U = Z. 

(a) If an integer n is not a multiple of 3, then n is not a multiple of 9. 

(c) If a polygon does not have six sides, then it is not a hexagon. 

(f) For all integers x, if x is odd, then x ¥ 6t for all integers ¢. 

(a) {« € Z|x <5} 

(d) {x ER|x Ss —20rx>4} 

(a) A — B = {1, 2, 3, 4, 5, 8, 10} 
B-—A = {11, 12} 

(e) A-—B={x ER|x = 10} 
B-A={ ER|x=4} 

Only (b) and (d) are equivalent to A C B. 


Exercises 2.2 


13. 


(a) A U B = (0, 1, 2, 3, 4, 5, 8} 

(d) A — B = {2, 3, 5} 

(g) AN (BUCO = {1,2, 4, 5} 

(i) (AUB)N(AU © = {1, 2, 3, 4, 5} 
. (a) A = {0, 6, 7, 8,..., 20} 

(d) A U B = {0, 2, 3, 5, 6,... , 20} 
(a) (0, 4] 

(d) [2, 4] 

: (a) (-~, 1) U (4, oo) 


(d) (0, 1) 


. (a) Hint: Let A = (a, b) and B = (c, d) where a = c. Show that c < b 


and then prove that A U B = (a, b) or (a, d). 


. (a) AU B= {n € Z| nis even orn is a multiple of 5} 


(d) B-— A = {n € Z| nis even but n is not a multiple of 10} 
(a) Letx € AN B. Then x € A and x € B. Since x € A, x = 10 and 
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14. (a) 


16. (a) 


22. (a) 


24. (a) 


since x € B, x > 5. So, 5 < x S 10. It follows that x € (5, 10]. 
Therefore, A M B C (5, 10]. 

Conversely, let x € (5, 10]. Then 5 < x < 10, implying that x = 
10 and x > 5. Hence x € A and x € B or in other words, x €E AN 
B. Thus (5, 10] C AN B. It now follows that A N B = (5, 10]. 


Letx € AN (A U B). Thenx € A andx € A U B. In particular, 
x € A, so AN (A U B) CA. Conversely, let x € A. Then x € 
A UB by part 5 of Proposition 2.2.2. Since x is also in A, we have 
that x € AM (A U B). Therefore, A CAN (A U B). It follows 
that AN (A U B) =A. 


We want to prove that if A is a subset of C and B is a subset of C, 
then A U B is a subset of C. Letx € AU B. Thenx € Aorxe€ 
B. If x € A, then x € C since A is a subset of C. If x € B, then 
x € C since B is also a subset of C. In either case, x is an element 
of C. Therefore, A UBCC. 
(AUB)NA=AN(AUB) 

= (ANA) U(ANB) 

=@U(ANB) 

=ANB 

= B — A by Example 5. 


First suppose that A = A — B. We want to prove that A MN B = ©. 
Suppose, on the contrary, that A 1 B #¥ ©. Then there exists x € 
A B. Since x € AN B,x € A andx € B. But if x € A, then 
x € A — Bsince A = A — B. This implies that x € B. Now we 
have the contradiction that x € B and x € B. Therefore, we can 
conclude that AN B = ©. 

Conversely, suppose that A M B = ©. We want to show that 
A =A — B. Any element of A — B is an element of A, by the 
definition of A — B. SoA — B CA. Now let x € A. Then x € B, 
otherwise x would be in A 1 B, contradicting the fact that A M 
B= ©. Thus x € A — B. This proves that A C A — B. Therefore, 
A=A-B. 


27. A X B = {(-1, 7), (-1, 12), (-1, 19), (-1, 21), (0, 7), (0, 12), (0, 19), 
(0, 21)}. 


28. Hint: Assume that r + x is rational and derive a contradiction. 


Exercises 2.3 


1. (a) 


P(A) = {©, {4}, {7}, {10}, {4, 7h, 14, 10}, {7, 10}, {4, 7, 10}} 


3. P(P(A)) = { ©, {Oh}, {{O}, {13}, (10, 1}, {S, 10}, {, {13}, {O, (0, Ut, HO}, 


{1}, {{0}, (0, 1}, {1}, {0, 13}, 1S, {0}, {1}, {©, {0}, {0, 13}, 1S, 
{1}, {0, 13}, {0}, {1}, (0, 13}, 1S, {0}, (13, {0, 1} } 
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. (a) false (d) true (g) true 


- YA = 01,2,3,...,404+ 1 


UA = [1 #), A= 2 


i=l i=1 


UA = (2,04 = 0 


. One partition of Z is {E, O}, where E and O are the sets of even and 


odd integers, respectively. A partition of R is {x E R|x > 1}, {fe ER 
|x < 1}, {1}. 


. (a) 1/13 
28. 


(a) 3/5 is not in C. 


Exercises 3.| 


3. 


4. 


5. 


(a) Im f = O, the set of odd integers 
f(X) = {4t + 1 | t+ © Z} 

(d) Im f = [-1, 1] 
f((0,21) = [0, 1] 

(g) Imf=Z 
f(EX Z)=E 


(a) not a function 
(c) a function. Im f = {x € R| x = 0} 
(e) a function. Im f=R 


(a) By definition, Im f C R, so we only have to show that R C Im f. 
Let y € R. Then f=) = 6(*) + 5 = y. Thus y € Imf 


8. (a) We will prove that f([0, 1]) = [0, 1]. If 0 = x <1, then multiplying 


these inequalities by x gives 0 = x? = x = 1, so that f(x) € [0, 1]. 
Hence f([0, 1]) C [0, 1]. Conversely, suppose that y € [0, 1]. Then 
0<y <1. Let x = Vy. Then y = f(x) and x = 0. If x > 1, then 
x? = xx > lx > 1. But y = x? = 1. Therefore, x € [0, 1] and we 
have shown that y € f([0, 1]). Hence [0, 1] C f({0, 1]). Therefore 


F({0, 1]) = [0, 1]. 


10. (a) Letn € Z. If n is even, then f(n) = n + 2 is even. If n is odd, then 


f(n) = 2n is even also. Therefore Im f C E. Now let y € E. Then 
y — 2 is even and f(y — 2) = y —2 + 2 = y, proving that y € 
Im f. Thus E C Im f. We can conclude that Im f = E. 
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12. (a) Imf=R 
(d) f((-1, 2]) = [-6, 14] 

17. (a) f (B) =A 
(d) f"Gd}) = © 

18. (a) f '(Wi) = O f (Way =A f'(Ws) = {0} 
Dif (Wi) = 2 f"(W,) =R FW) S01 
(g) fF '(W) = {0 f-'\(W2) = © 

19. (a) f-'({11, 12, 13, 14, 15}) = {5, 22, 24, 26, 28, 30} 

21. (a) By the definition of inverse image, f '(f(X)) = {a € A | f(a) € 
f(X)}. So if a € X, then f(a) € f(X), proving that a € f-'(f(X)). Hence 
XC fF(X)). 


Exercises 3.2 


1. (a) fis surjective. By definition, Im f C R, so we only have to show that 
RC Im f Let y € R. Then f(*) = 2(+) + 1 = y. Thus y € Im f. 
(d) fis surjective. Let y © R*. Let x = i. Then f@) == a ye SO 
ye Ime. 
(g) fis surjective. Let y € R*. Let x = In y. Note that x exists since 
y > 0. Then f(x) = e”” = y. Therefore, y € Imf. 
(j) fis not surjective. For example, (0, 1) is not in Im f. 


2. (a) fis not surjective because f(n) is odd for all n in Z. 
(d) fis surjective. To see this, let x € Z. Let n = 2x + 1. Then n is odd 
so f(n) = f(2x + 1) = +44 = x. So x is in the image of f. 


12. (a) f is injective. Suppose that x and y € R and f(x) = f(y). Then 
2x + 1 =2y + 1. It follows that 2x = 2y and therefore x = y. 
(d) fis injective. Suppose that x, y © R* and f(x) = f(y). Then + = 
and thus x = y. 
(g) fis not injective since f(1, —1) = f(2, —2) = 0. 


i 
y 


13. (b) f is injective. To prove this, suppose that f(n) = f(m) where 
n,m © Z. If n, m are both even, then we get immediately that 
n = m. If n and m are both odd, then we get 2n — 1 = 2m — 1, 
which implies that n = m. If n is even and m is odd, we get 
n = 2m — 1. But this cannot happen since n is even and 2m — 1 
is odd. Therefore, this case cannot occur. For the cases that can 
occur, we have shown that whenever f(n) = f(m), then n = m. 
So f is injective. 

(d) fis not injective since f(2) = f(5) = 2. 


24. (a) f: R — R defined by f(x) = x. 
f: R — R defined by f(x) = x°. 
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x?, ifx=0 


f:R— R defined by f(x) = { 
—x?, ifx <0. 


Exercises 3.3 


1. 


2 


10. 


11. 


13. 


(a) fe(x) = sin?’ x + 1 
ef(x) = sin(x? + 1) 

(a) Let n € Z. If n is even, (fg)(n) = f(g(n)) = f(n) = 2n. If n is odd, 
(fg)(n) = f(g(n)) = fn — 1) = 2(2n — 1) = 4n — 2. 
For any n € Z, (gf)(n) = g(f(n)) = g(2n) = 2n. 


. (a) g, h © F(R), g(x) = x®, h(x) = 3x? - 11 


(d) g: RR, g(x) = x — 1h: {0, 1} > R, A(O) = 2, A(1) = 1 


. (a) Hint. Since your objective is to show that f is injective, you should 


start the proof by letting a, b, © A and assume that f(a) = f(b). Then 
prove that a = b. To do that you will have to use the fact that gf is 
injective in some way. 


(a) Not invertible. fis not injective. 

(d) Invertible. f(x) = +2 

(g) Not invertible. fis not injective and not surjective. 
(j) Invertible. f(x) = In x 


(a) Let x, y € R, x 41, and y ¥ 1, and suppose that f(x) = f(y). Then 
= Se ee -Y)OtD = et Dy Daya y tee i= 


xyt+ty-x-l,-yt+tx=y-x,2x =2yx=y. 


(a) The six permutations are defined by 
fila) = a, fi(b) = 6, file) =¢ fila) = b, f(b) = c, fic) = a 
f(a) = a, fi(b) = ¢, fic) = b f(a) = ¢, f(b) = a, falc) = b 
fila) = b, filb) = a, filc) = ¢ fla) = c, fo(b) = b, fo(c) = a 


Exercises 4.| 


1. (a) Leta b EC R — {0}. Then a + b = ¢ is a real number since b ¥ 0 


and ¢ # 0 since a # 0. Therefore, a + b © R — {0}, proving that 
division is a binary operation on R — {0}. 


2. (a) i. Letn, m, p © Z. Then 


(n*xm)*p=(ntm+2)*p=n+mt+2+p+2=n+m 

+pt+4 

n*e(m* p)=n*¥(m+pt+2)=nt+m+pt2+2=n+m 

+pt+4. 

Therefore, * is associative. 

ii. Letn andm € Z. Thenn*m=n+m+2=m+n4+2= 
m * n. Therefore, * is commutative. 
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10. 


11. 


16. 


19. 


22. 


235 


(d) i. Letn, m p © Z. Then 


n *(m * p) = max(n, max(m, p)) = max(n, m, p) 
= max(max(n, m), p) = (n * m) * p. 

Therefore, * is associative. 

ii. Let n and m € Z. Then n * m = max(n, m) = max(m n) = 
m * n. Therefore, * is commutative. 

(g) i (2 * 3) *4 = 2% 4 = 64% 4 = 2°92 * (3 * 4) = 2% (2") =2% 

4096 = 28!” Therefore, * is not associative. 

li. Ifx, y ER, thenx * y = 2” = 2” = y «x. Therefore, * is commu- 
tative. 


- (a) Let f, g © F(R). (f + g)(x) = f(x) + (x) = a(x) + f(x) = & + 


f)(x) for all x in R. Therefore, f + g = g + f and so addition on 
F(R) is commutative. 


. (a) Letn © Z. Then n * (—2) = (—2) *n =n + (—2) + 2 =n. Therefore 


—2 is the identity element. 

(d) Ifm € Z, thenn * 1 = 1 *m = max(n, 1) =n. Therefore, 1 is the 
identity w.r.t. *. 

(g) Suppose there exists an identity element e w.r.t. *. Then e * 
(-1) = 2°¢ = —1. But this cannot happen since every power of 2 is 
positive. Thus there is no identity. 

(a) Letn € Z*. Then n * (—n — 4) = (-—n — 4) *n=n+ (-n- 4) + 
2 = —2. Thus —n — 4 is the inverse of n w.r.t. *. 

(d) Letn € Z*. Suppose that n has an inverse x wr.t. *. Then n * x = 
max(n, x) = 1. Thus n = x = 1. Therefore, only 1 has an inverse 
w.r.t. *. 


a b\/(x y 1 0 
Hint. Consider the matrix equation ( ( = ( . Solve for 
c d/\z w 0 1 


x, y, z, and w in terms of a, b, c, and d. You will see that a solution 
exists if and only if ad — bc # 0. Check that your answer gives a solution 


x y\/a b 1 0 
to ( )( )=( )as well. 
z w/c a 0 1 


(a) Let a, b, © E. Then a = 2x and b = 2y for some x, y € Z. Then 
ab = (2x)(2y) = 2(2xy) € E. Therefore, E is closed under multipli- 
cation. 

(a) Let a, b, © E. Then a = 2x and b = 2y for some x, y € Z. Then 
a* b = (2x) * (2y) = 2x + 4y = 2(x + 2y) © E. Therefore E is 
closed under *. 

(a) {f € F(A) | fis injective} is closed under composition by part 2 of 
Proposition 3.3.4. 

(a) Let S = {f € F(R) | (0) = O}. Let f g © S. Then (f + g)(0) = f(0) + 
g(0) = 0 + 0 = O. Thus f + g € S, so S is closed under addition. 
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(d) Let S = {f © F(R) | f(—x) = f(x) for all x € R}. Let f g © S. Then 
(f + g)(—x) = f(—x) + g(—x) = f(%) + a(x) = (Ff + g)(x) for all 
x ER. Hence f + g € S. Therefore S is closed under addition. 
25. (a) Let S = {f € F(R) | f(0) = O}. Let f g © S. Then fg(0) = 
f(g(0)) = f(0) = 0. Therefore, fg € S, proving that S is closed 
under composition. 
(d) Let S = {f © F(R) | f(—x) = f(x) for all x © R}. Let f g © S. Then 


fg(—x) = f(g(—x)) = f(g(x)) = fg(x). Thus fg € S, so S is closed 
under composition. 


26. (a) Let S = {(° ’) € M,(R) | b= 1} Let (‘ ’) and (* ) ES. 


c a Zw 
a b x y atx bty 
So b = y = 0. Then )+ )=( )- 
c d Zw c+z dt+w 
at+x 0 
( ) € S. So S is closed under addition. 
ct+z dt+w 


36. Hint: Exercise 11 will help. 


Exercises 4.2 


1. (a) ORO, so R is not reflexive. 
2R1 but 1R2. Therefore R is not symmetric. 
Let a, b, c € Z and suppose aRb and bRc. Thena > b andb > c, 
so a > c. Hence aRb. Therefore, R is transitive. 
R is antisymmetric for the same reason that the relation in Example 
10 is antisymmetric. 


3. (a) R is not reflexive since no set is a proper subset of itself. 

If X C Y, then X is a proper subset of Y and therefore Y is not a 
proper subset of X. Thus R is not symmetric. 
R is transitive by a proof similar to the proof of Proposition 2.1.2. 
Ris antisymmetric since the statement X¥ C Yand Y C Xis never true. 

(d) XRX for all X in P(A) since |X| = |X|. Thus R is reflexive. 
If |X| = |Y|, then |Y| = |X|. Therefore if XRY, then YRX. Hence 
R is symmetric. 
If |X| = |¥| and |Y| = |W], then |X| = |W]. Thus if XRY and YRW, 
then XRW. Therefore R is transitive. 
R is not antisymmetric. Let A = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}, X = {1, 
2, 3}, Y = {2, 3, 4}. Then XRY and YRX, but X # Y. 


4. (a) If f © F(R), then f(0) = f(0), so fRf. Therefore, R is reflexive. 
Let f. g © F(R). Suppose that fRg. Then f(0) = g(0) and so g(0) = 
f(0). Hence gRf. Therefore, R is symmetric. 
Let fg, h © F(R). Suppose that fRg and gRh. Then f(0) = g(0) and 
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5. (a) 


12. (a) 


. (a 


—m 


g(0) = h(0). It follows that f(0) = h(0) and therefore fRh. Hence 
R is transitive. Therefore, R is an equivalence relation. 

If f € F(R), then the equivalence class of fis the set of all functions 
that intersect f at the origin. 


[18] = {x € Z| x = 4t + 2 for some t € Z} = {2, —2, 6, —6, 10, 
—10,...}. 


Let a € Z*. Then aka since a = 1a. Therefore, R is reflexive. 

Let a, b, c € Z*. Suppose that aRb and bRc. Then b = na and c = 
mb for some n, m € Z*. Hence c = mb = m(na) = (mn)a, so aRc. 
Thus R is transitive. 

Let a, b € Z*. Suppose that aRb and bRa. Then b = na anda = 
mb for some n, m € Z*. We get b = na =n(mb) = (nm)b. Therefore, 
nm = 1 and since n and m are positive integers, it follows that n = 
m = 1. Therefore, a = b, proving that R is antisymmetric. 


Let f © F(R). Clearly, f(x) = f(x) for all x € R, so fRf Therefore, 
R is reflexive. 

Let f, g, h € F(R) and suppose that fRg and gRh. Then f(x) S g(x) 
and g(x) = A(x) for all x € R. Thus f(x) = h(x) for all x € R, fRhA. 
Therefore, R is transitive. 

Let f, g © F(R) and suppose that fRg and gRf. Then f(x) = g(x) 
and g(x) = f(x) for all x © R. Thus f(x) = g(x) for all x © R. So 
f = g. Hence R is antisymmetric. 


- (a) ([a, 6] + [eo d)) + [ef] 


= [ad + be, bd] + [e, ff] = [(ad + bc)f + (bd)e, (bd)f] 
= [adf + bcf + bde, bdf] = [a(df) + b(cf + de), b(df)] 
= [a, b] + [cf + de, df] = [a, b] + ([c, a] + [e, f]) 


. (a) (0, x)R(O, y) since 0: y = x- 0 = 0. Therefore, [0, x] = [0, y]. 
. (a) [a b] + [0, 1] = [a-1 + 5-0, 5-1] = [a D] for all [a, b] E QD 


Therefore, [0, 1] is the additive identity of Q. 


Exercises 5.1 


1. Leta, b,c € Z. 


(a) Since a + (—a) = —a + a = 0, it follows that a is the additive inverse 


of —a; in other words, —(—a) = a. 


(d) (-1)a + a = (—1)a + la by Axiom 7 


= ((-1) + 1)a by Axiom 8 

= 0a = 0 by part 1 of Proposition 5.1.1. 
Therefore, (—1)a is the additive inverse of a, implying that 
(-1)a = —a. 


5. (a) Suppose that a < 0 and b < 0. Then —a > 0 and —b > 0 by part 2 


of Proposition 5.1.4. Hence (—a)(—b) > 0 by part 3 of Proposition 
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5.1.4. But, by part 4 of Proposition 5.1.1, (—a)(—b) = ab. Therefore, 
ab > 0. 

(d) Suppose a < b andc > 0. Then b — a € Z* andc € Z*. By Axiom 
9, (b — a)c € Z*. But (b — a)c = be — ac by Axiom 8. Therefore, 
bc — ac € Z, so that ac < be. 


7. Hint: Prove the contrapositive. 
11. (a) Hint: Assume that b ¢ Z* and derive a contradiction. 


19. (a) Hint: Use the fact that if x © Z”, then —x € Z* and 1 is the smallest 
element of Z*. 


Exercises 5.2 


1. (a) Let P(n) be the statement “1 +3 +5 +... + (Qn - 1) =n.” 
Since 1 = 12, P(1) is true. Suppose now that P(k) is true. Then 1 + 
34+5+...+ (2k —1) =k’. This is the induction hypothesis. Hence 


1+34+5+...+ (2k —-1) + (%(k +1) -1) 
=h+2k+1=(k+1). 


Thus P(k + 1) is true. Therefore, by induction, P() is true for all 
integers n. 
4. (a) Let P(n) be the statement: “2” > n’.”’ Since 2° > 5’, P(5) is true. 

Suppose that P(k) is true. Then 2‘ > k’. It follows that 2'*! = 

2° 2* > 2k? > (k + 1). The last inequality is true since 2k? — 

(A+ 1~p=kh-2k-1>kh - 2k —-3 = (k —- 3)(kK +1) >0 

since k > 4. This proves that P(k + 1) is true. So by induction, 

P(n) is true for all positive integers n = S. 


& (aA HPLA=HA1A=2,f=3,6 =5,f6 = 8 ff = 13, fe = 21, fo = 34, 
fio = 55 
(d) Let P(n) be the statement “fj + f3+...+ f%=fifier.” Since ft = 
fif: = 1, P(1) is true. Now suppose that P(x) is true for a positive 
integer k. Then f7+ f3+...+ f%=fifisi. Therefore, 


Fit fot... + fit fla = fiefs + fea = foie + fest) = fari fara 


This proves that P(k + 1) is true and so by induction, P(n) is true 
for all positive integers n. 
9. (a) A, = 1, H; = 3, H, = 6, Hs = 10, Hy = 15 

16. (a) The basis for this proof is Proposition 3.3.4, which says that the 
composition of two surjective functions is surjective. Let P(n) be 
the statement “If fi, fi,...,f, are surjective functions in F(A), then 
the composition f,f, ... f, is surjective. Clearly P(1) is true. Suppose 
that P(k) is true for a positive integer k. Let fi, fi, ... , fi, 
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25. 
27. 


29. 


30. 


31 


fir: © F(A) be surjective functions. By the induction hypothesis, 
the composition fif, ... f; is surjective. By Proposition 3.3.4, the 


composition (fifi. ... fi) fis is surjective. But (fifi ... fife = 
fifi .-- fifi: Since composition is an associative binary opera- 
tion on F(A). Therefore, P(k + 1) is true. Thus P(n) is true for 
all positive integers n. 


Hint: Use the previous problem. 
B; = 0, Bs = 1/42, B, = 0, Bs = —1/30 


+2434... + (= 1S =F nil — 1)°Qn* — 2n - 1) 


(a) Hint: Apply the Binomial Theorem to (€ + 1)**1. 
(d) S4(n) = 5° — tn + bn) — aon 


6 
MOR Se 


36. 


(a) f(0) = fC + 0) = FO) + FO) = 2f(0). Thus f(0) = 0. 


Exercises 5.3 


1. 


4. 


6. 


10. 


(a) Let xn € Z. By the Division Algorithm, there exists t and r in Z such 
that n = 3t + r, where 0 =r < 3. Son = 3t.n = 3t+ l,orn= 
3t + 2. 


(a) Let P(n) be the statement: 4 | 5" — 1. Since 4| 5 — 1, P(1) is true. 
Suppose that P(k) is true; that is, 4 | 5‘ — 1. Then 5* — 1 = 4¢ for 
some tin Z. Hence 5**! — 1 = 5(5* — 1) + 4 = 5(4t) + 4 = 4(5t + 
1). Therefore 4 | 5**! — 1 and P(k + 1) is true. By induction, P(n) 
is true for all integers n. 


(a) Ifa|b and 5 | a, then there exist c and d in Z such that ac = b and 
bd = a. We get a = bd = (ac)d = a(cd), which implies that cd = 
1. Hence c | 1 and by part 1 of Lemma 5.3.3, c = +1. Therefore, 
a=xb., 

Converse of 1: If a = +1, then a | 1. True: Since (1)(1) = 1 and 
(—1)(-1) = 1, it follows that both 1 and —1 divide 1. 

Converse of 2: If a = +b, then a | b and b | a. True: If a = b, 
then a(1) = b and b(1) = a, which shows that a | b and b | a. If 
a = —b, then a(—1) = b and b(-1) = a, again showing that a | b 
and b | a. 

(a) 1840 = (1518)(1) + 322 

1518 = (322)(4) + 230 

322 = (230)(1) + 92 
230 = (92)(2) + 46 
92 = (46)(2) + 0 


(e 


a 
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11. 
14. 


16. 


20. 


21. 


Therefore, (1840, 1518) = 46. 
46 = (1840)(—14) + (1518)(17) 
Hint: Use the fact that d = ax + by for some x, y € Z. 


Hint: This is a set theoretic proof. Prove that nZ N mZ C nmzZ. (Exercise 
13 will help.) Then prove that nmZ C nZ 0 mZ. 


(a) Hint: Let d = (n,n + 2). Prove that d divides 2 and then show that 
d cannot be 2. 

Hint: First show that if x and y are even, then z is even, a contradiction. 

Then assuming x and y are odd, show that z is even. Writing x as 2t + 


1 and y as 2s + 1 for some integers ¢ and s, derive the contradiction that 
4 divides 2. 


Hint: Assume that x and y are not divisible by 3 and use Exercise 1(a) 
to derive a contradiction similar to the one for Exercise 20. 


Exercises 5.4 


. Hint: Prove the contrapositive. 


4. Hint: Use induction on the integer m. 


7. Hint: Assume that log; 10 is rational. Use the definition of logarithm to 


derive an equation that contradicts Theorem 5.4.7. 


. (a) 594 =2x 3>X 11 


(d) 191,737 = 2 x 13 x 43 


. Hint: Use Theorem 5.4.7. 
. Hint: Write m as a(2b + 1) where a is a power of 2. (Explain why you 


can do this.) Then show that 2* + 1 is a factor of 2” + 1 to conclude 
that b = 0. 


. Hint: Show that if n is not prime, then 2” — 1 can be factored. 
. Hint: Prove the contrapositive. 


. Hint: Use Theorem 5.4.7. 


Exercises 5.5 


1. 


3. 


(a) Ifx is even, then x = 2k for some integer k. Then x” = 4k? = 0(mod 
4) since 4 divides 4k’. 
Alternative proof: if x is even then x = 0 or 2(mod 4). Then 
x? = 0 = O(mod 4) or x? = 2? = 4 = 0(mod 4). 
(a) Suppose that there exist integers x and y such that x? = 8y + 3. 
Then x? = 3(mod 8). But by Exercise 2, x? = 0, 1 or 4(mod 8). 
Therefore no such integers x and y exist. 
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ioe) 


. (a) [12] 

12. (a) Let [a], [b], [c] © Z,. Then ({a] + [b]) + [c] = [a + 6] + [c] = 
[(a + b) + c] = [a + (6 + c)] = [a] + [b + c] = [a] + 
([b] + [c]). Therefore, addition is associative. 

15. (a) [4] 

(d) [8] 

16. (a) 21 

17. (a) [15] 

(d) [617] 


18. Addition and multiplication tables for Zs: 


[0] Oo fF 2 BB ff f) ff 7 
(1) oH 2 Bo 4 UO} hf} 6) OO 
[2] 2] Bl 4 GS) ff fT ff) 
[3] 3B} ff) ) 6 7 bb OF EP 
[4] 4] §) 6 7 bb © fF BB) 
[5] 5} 6 7 8 © 2 BB 
[6] 6 7 © fF 2 B) 4) OB) 
[7] 7 8 8 @ BF Fw &) 


oo f BF 8 WW SS) 6) 


[0] [0] [oO] [0] [o} [0] [0 [0] — [0] 
(1) o © 2 BF & F) f 
[2] oO 2) 4) «f))6|6fOl6hlhU]lhU) C6) 
[3] CC) is) 2) 2 Cc) 2 
[4] (0) [4] © 4 & fF © ff) 
[5] Oo § 2 MM 4 © &) Bl 
[6] (] [6 4) 2) (0) [6 [4] fI 
[7] fo) 7) 6) ) 4 2B 2) 


The elements in Z, that have multiplicative inverses are: [1], [3], [5], 
and [7]. 


28. (a) 7 


Exercises 6.1 


6. (a) f: Z => O, f(n) = 2n + 1 


312 ANSWERS AND HINTS TO SELECTED EXERCISES 


( 2n + 100, ifn >0 
d) f:Z—S, f(n) = 
DF I”) —2n + 101, ifn <0 
(g) f: Z—> Z X {0}, f(n) = (a, 0) 
8. (b) Hint: First prove that if x ¢ A, then A U {x} is finite. Then, if 7 is 


the number of elements of B, use induction on nv to prove that A U 
B is finite. 


10. f(10) = 4, f(15) = 1/5 
11. (c) Hint: First prove that the set {e* | x © Z} is countably infinite. 


Exercises 6.2 


1. (a) Hint: Define a linear function from [0, 1] to [a, b] 


5. (a) Reminder: To prove that fis surjective, you need first to show that 
Im f C (0, 1) and then show that (0, 1) C Im f. 

(c) Hint: First define a bijection from R* to (a, ©). 

7. (a) Hint: Show that [0, 1] < [0, 1] U [2, 3] x R and then use Lemma 
6.2.4 and the Schroeder—Bernstein Theorem. 

(d) Hint: First show that there is an injection f: (0,1) X (0, 1) — R. 
To do this, recall that any element of (0, 1) can be represented as 
.4,0)03.... , where a), a), a3, ... are integers between 0 and 9, and 
that every such expansion is unique if we do not use decimal expan- 
sions ending in an infinite string of 9’s. 

(g) Hint: First, define a function 3: F(Z*) — (0, 1) as follows. Denote 
the binary expansion of an integer n by n,. If f € F(Z*), let O(f) = 
0.f(1),2f(2),2f(3),2 ..., interpreted as a decimal expansion. Prove 
that is an injection. 

Second, define a function A: P(Z*) > F(Z") as follows. If A is 
a subset of Z*, then A(A) is the function f: Z* — Z* defined by 


lifnEA 
f(n) = Prove that A is an injection. 
2ifn € A. 


Exercises 6.3 


2. (a) Hint: Assume that (a, b) < (c, d) and (c, d) < (a, b) and derive a 


contradiction. 
4. Hint: Let A: [] A — P(Z*) be the function defined as follows: an element 
i€Z* 
fin [] A is a function f: Z* — {0, 1}, so let A(f) = {n € Z* | f(n) = 
ieZ* 


0} € P(Z*). Prove that A is a bijection. 
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Exercises 7.1 


3. (a) Hint: Use induction on n. 

6. (a) Hint: With m fixed, use induction on n. 
10. (a) Hint: Assume that x"! ¢ P and derive a contradiction. 
15. Hint: First prove that Z* C P. 


17. (a) Hint: Assume that —x € N and derive a contradiction. 


Exercises 7.2 


1. (a) lub S does not exist; glb S = —1 
(b) lub S = 1; glb S = 0 


2. Hint: Consider the set T = {—x | x € S}. 


4. (a) Hint: Use the fact that if a < x < b, then there is a real number y 


such thatx <y <b. 
15. Hint: Use Exercise 12 of Section 7.1. 


Exercises 7.3 


2. (a) 22 - 21i 
(d) —234 — 415i 
(g) 34 8. 
61 61° 


7. (a) 3V2(cos(3a/4) + i sin(37/4)) 
(d) 2(cos(37/2) + i sin(37/2)) 
(g) V26(cos 6 + i sin 6) where 6 = sin“(1/V/26) 


9. (a) 648V3 
(d) 2%(1 + i) 
(g) — 230318 
10. (a) The fourth roots of z are: 


2, 2 


v3 
2 


<= Vi08 (+), a= vi06 (-P+4i), 


3=V 108 ( -1i), au=wv 108 (- 
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12. 


The fifth roots of unity are: 
_,  _V5-1 , Vi0+2V5, 
wy) = 1,0, = —— + ———), 
4 4 
fo eS) OSSD V E25. 
2a ] 
4 8 


jus OVS 1) (VS = VOSS. 
32 =. 4 8 2 


VEST VOR IVS: 
4 4. 


Og 


Exercises 8.] 


. (a) Hint: Prove the contrapositive. 
» (a) f(x) + g(x) = x4 + 9x3 — 6x? — 10x + 7 


- (a) G) fe) + g(x) = [4]x° + [3]? + [6]x + [5] 


(ii) f(x)g(x) = [1]x? + [4]x4 + [2]x? + [2]x? + [2]x + [6] 


- (a) q(x) = x8 +x -—1,r(x) = 9x + 6 
. (a) —1 is the only real zero. 
| {i Wa 
Sie and See 
. (a) 175 5 and a 


. (a) Any integer x such that x = 4 or 7 (mod 11) is a solution. 


. (a) x? = a(mod 7) is solvable for a = 1, 2, 4 and not solvable for a = 


3, 5, 6. 


Exercises 8.2 


2. 


(a) x? — 5 is irreducible over Q. 
x? — 5 = (x — V5)(x + V5) over R and C. 


14. (a) The g.c.d. of f(x) and g(x) is the constant function 1. 


20. 


The only irreducible polynomial of degree 2 in Z,[x] is [1]x? + [1]x + 
[1]. The irreducible polynomials of degree 3 are [1]x? + [1]x? + [1] and 
[1Jx? + [1x + [1]. 
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Exercises 8.3 


4. (a) There are three real zeros: x, ~ —3.423, x, ~ .378, and x; ~ 1.545. 


12. (a) 2 is the only rational zero of f(x). 
f(x) = (x — 2)(x? + x — 1) in Qf]. 
f(x) = (x — 2)(x +: 1/2 — V5/2)(x + 1/2 + V5/2) in R[x] and C[x). 
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